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The Test of English as a Foreign Language (TOEFL) 
measures the English language proficiency of foreign students who 
apply for higher education in the United States. It is administered 
according to a strict schedule through the International and Special 
Center (I^ac) testing program or at ad hoc intervals by institutional 
users as part of the TOEFL Institutional Testing Program (INST), This 
study investigated the patterns of test taking and score change for 
SKBmnees who rapsatsd the TOE an International or I^SC 

administration within 24 to 60 months after initial testing^ Data for 
examinees who repeated TOEFL within 1 to 12 months following initial 
testing by institutions participating in the TOEFL INST lere also 
analyzed « In both the I&SC and INST samples^ TOEFL repeaters 
registered substantial average net gains in performance* Findings 
suggested that there may be differences among national-linguistic 
groups in typical rate of acquisition of proficiency in English. The 
magnitude of the average score changes found to be associated with 
test repetition for repeaters, combined with relatively strong 
differences among national*"linguistic groups in both incidence of 
test repetition and mean score change, indicates that for purposes of 
smmary reporting on TOEFL performance (e*g*, in defining basic 
reference groups), it is important to specify the test-repetition 
status of the gorups of examinees involved, (JAl) 
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The Test of Ingliah as a Foreign Language (TOEFL) was daveloped in 1963 by a 
National Couneil on the Testing of English as a Foreign Languagei which was formed 
through theGOoperatlve effort of over thirty organizations, public and private* that were 
concerned with testing the Inglish proficiency of nonnatlve speakers of the language 
applying far admission to Institutions in the United States, In 1965, iducatlonal Testing 
Servloe (ETS) and the Coilege Board assumed Joint responsibility for the program and 
In 1i73 a cooperative arrangement for the operation of the program was entered Into 
by ETS, the Oollege Board, and the Graduate Record Examinations (GRE) Board, The 
membership of the College Board is composed of achooISi colleges, SQhool systems, 
and eduQational associations; ORE Board members are assoslated with graduate 
education. 

ETS administers the TOIFL program under the general direction of a Policy Council 
that was established by, and is affiliated with, the sponsoring organizations. Members 
of the Policy Council represent the College Board and the GRE Board and such 
Institutions and agencies ^ graduate schools of business. Junior and community 
eolleges, nonprofit educational exchange ageneleSi and agensles of the United States 
government. 



A continuing program of research related to TOEFL is carried out under the direction 
of the TOEFL Researoh Committee. Its six members Include representatives of the 
Policy Council, the TOEFL Committee of Examiners, and distinguished English^as-a- 
second'ianguage specialists from the academic community. Currently the committee 
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chair of the committee serves on the Policy GouncIL 
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in the teaching of English as a foreign or second language. Representatives of such 
programs who are Interested In participating In or conducting TOEFL-reiated research 
are invited to contact the TOEFL program office. Local research may sometimes 
require access to TOEFL data. In such cases, the program may provide this data 
following approval by the Research Committee. All TOEFL research projects must 
undergo appropriate ETS review to ascertain that the confidentiality of data will be 
protected. 

Current (1i8M7) members of the TOiFL Research Committee Include the following: 
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Abstract 



^Adm stuc^ mm prljioipally ooncsrued yAtii the of p ^^orts of 

test taking mnd spore dbmge for m^mas r^eated Tert BfigllA as 
a Itorel^ Uwgiia^ (TOiit) in InteEfiational or ^pi^al <mmt wtting 
(laSC) Mminlstraticsi^^ 24 to 60 laQCiOis after ini^a^ t^^stirip. D0ib for 
Wimdn^B yjiq tmpm^ ^thin 1 t o 12 nontiis f c^^^rsj tei^^ testing 

by ii^titutidtm paitici^tii^ in tte TOEi^ Mstttitfi^it ^^fe g Program 

laSC and liltt^ npeaters 

ragistersd m^tantial garage imt ^ins in parfocnmip^ fii^D^ suggested 
timt^^ere diQf be di^ In Ig^lcal rat@ 

of aoqpaisltiQn of prM TOpatodi^ ^^mmgrn score 

cdianges jEduffd to^ \dth test rapetiticn fiir i^pea^^a, cccb^ied 

relatively strof^ ^f^ ^ncf^ naticnal-lirigi^stiQ ^ou^ in both 

ineidmee of test rt^etitlon eund man soora ^wge/ il^oates that for 
purposes of suitinry r^ortij^ on TC^li perfomanae Ce.g«i in defining tosia 
referwiM gtwfm), it is in^rtOTt to specify tim test--r^petitiQn status of 
the groiips of escaminMS inyolved. 
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Itiim B^^i^ mm itade posiibl^ ^ the siq^rt o£ the TOEfili Besearch Cm- 
aiit^^. Tdti»td Itarrlsw parfonred ths ecnplex tort of collmting TOOTj 
records for ^ceioii^s^^- in Intamatloul or ^peoial ^iter dMlnis- 

tratlensov^r a fl^^a-ytar pf rlod, and he also progranmed tesie anal^es of ^to 
for ti:^sa i^iaistMes . Sti^en VBil carried cut siMlar taAs cofrocted ^v^th ti:m 
aniQ.yEis of data for ^^mimes testod as pirt of lU^'L instituticnal 
Tem^tig Pr^^mim Bwmnt Bic^ Bmntmi, bM Qmrlae St^fisfleld 

rmd ^rad oammit&A on ^rlous versions of tt^ nufn^cript* Qiese ocntributicns 
are adorawla^rsd vdtti appreciation, Hd&^e^mr, the writer assumis reqpcnslbil- 
i^ for ti^ contOTts of tola r^^rt. 
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Fattams of Test •^Jcing md Saora Osmge for Ksuninees Wio M^at 
Test of ^ngllah ai a Forei^ Lwguaga 

K^rmeth M, Wilson 
durational Ttiting Service 

ESLti^i year* thousands of nomative shakers of agli^ pliHming uncter- 
graduata or graduate stud^ in the Uhited Stotas take the Test of B^lirti as a 
Forai^ Language (TOTXi) In order to dansnstrata thair level of (Sevelqjrt pro- 
ficimey in Bigll^. TCEBlt provi&s separata BnaaEuras of listening canpre- 
h^isian (LC)* struotura and written e^rassiai (S^)* vooabulary airf 

raading coqprdian (V^C)« 

TOEfli is offered in sdieduled, strictly COTtrelled moithly ^hdnistra- 
ticns throu0i the Intamatioml and ^pacial Cmter (I^SG) testing progrim* Bie 
test is also administered at ad hoc intanmls by institutional usars as part 
of tha TOlFli Institutional Testing Frogram (^CT), lariar c^iditims controlled 
by individual institutions, ito great majori^ of axaminaes are tested in 
z&SC ^ndnistrations, 

Ma^ of tiiese indiviAals take TOEEIj inore tiian cnce^ — ^that is, they are 
repeatar aKaminees, Based an a stu^ of the diaractaristics ad t^st 
parfonnmca of laSC examirees tasted during 1977-1979 (Wilson, 1982), it v?as 
estimate that perhaps as inary as cna thirf of TOTfii examinees repeat one or 
niore times. Incifenoa of test repetition vmB fownri to vBry considerably by 
country of origin, ranging frcm less Uwi 10 percent to o^r 50 percent, e^d 
naticnai- linguistic contingents witii hi^ier ^rcentages of repeating exattninees 
tended to be ti^sa witii lower meai TOTIj scores. Howavar, the stu^ did not 
examina cpjestlcns ra^rding characteristic pattarra of test taking ^rri score 
idiange for the ^camiTOas yAm related. 



Seme irking ^swpti^is Bepirdlng Test Bi^etltlon mid 
Score Omga for TD^Ij l&BC Bapeaters 

It is assumed that first--time isSC test t^ers vAiosa English proficiency 
as reflected in scores an. TDEEii falls balcvr some parsOTal or external criter- 
lOT level will continue to take WEFL periodically mtil th^ eitiier meat tha 
critarim lavcl or, failing to do so, S:ap out of the TO^i. p^ulatim* It is 
also assumed (a) that betoeen tastings tii^ will attmpt to inprova their 
proficient in Btigli^ tiirougfi fomal ar^or infontBl stfi^/^ractice and (b) 
tiTat, on the average, seme iitprovement in Engli^ proficient is likely to 
occur amcng repeating exOTdraes betoeen test administratiafis— inproveitent that 
will be reflected in average increases in TQES^ scores- Increased "test- 
wiseness" md familiari^ with the ^cific test format, stotistiral factors 
sudi as regressim to the meai, &M so m, are factors that shoudd te kept in 
mind in evalyating ^served ^an^s. 
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Stu^ Objectives 

Tte iti^ ^mm cteiigt^ prlrorlly to obtain aieweri to questims midi as 
tte followli^ ra^rding test tiJcing behavior and "lOTg^tem" score change for 
c^rts of flrat-tina ramiiiees in intematlmal arri ^cial Center (I&SC) 
teat aAnlnistratiang i 

(a) Wiat are the test-taking patterns of cottorts of firat-tima I^SC 
test-takers from different rmtive oountry and Imgyege grouq^? P^t is thB 
ineidmca of 2- time ^ . . t-^Tlro test taking? 

(b) Pttiat are tte pattorM of ctenge in maan TOmij perfontiance for 
repeating ^canirees by iMriber of times tested ^ by eoimtcy of origin? 

(c) ^aiat is tiifi relatiariahip between drange in TOEil, stom aid %mriables 
sudi as time interval betwem tl^ initial and last test aitednistrations (with= 
xn the stu^ ^riod)^ nwifcer of times tes^, a^, sex, test center (U.S. vs 
otter )^ edicati^ml level (\nriargriduate vs graduate)^ arad so m? 

(d) ^aUch of tiiese variables rontribute mDst to pr^etim of last-tine 
score after t^ing first--tima score into accoimt? 



Stu^ Data 

Ite stu% fOOTsed prirarily cm data for examimes tested for the first 
time in a sei^di^ed, msnthly T^mj int^rnaticnal anft/or Special Center test 
^mliristratlOT batwe^ ^y 1977 and Jmm 1980, ^ had acMmalated at least 
OTe additional test record in an MSC test a^dnistratiim as of Juna 1982* 
Oie follofe^i^ perioct— ranging from a mjnimm of two years to a m^imum of five 
years following initial testin^-^s deared to to lof^ enough to cm^r tiie 
tenure of most first-time test t^ers as "potential participmts in m isSC 
test administratiai, " 

.toalysis of ion^term test-tedcing ^^vior aid score diauige for l^SC re- 
peaters yms si^pl^aented by analysis of ^te from TOm% institutional TOsting 
progrOTi files for a saaple of a^rmimately 10,000 individuals \Aio were 
identified as first-tiroi institaatimallv-tested Tmsis ^CTOinees, vdth testing 
dates betoem Jtdy 1984 ana Aigust IBab/wor ttese TOST examines, the mwinuti 
tire interval ^ for test r^titiOT ras etoeut 12 nwiths. su^wentary 
analysis provi<ted information re^rding dtort-tenn dmigB in TCmL perfoniwic© 
for indi^ ^Aals presmied to hm^ be&i mrolled in progrM^ of instructioo in 
E^lish as a secOTd langua^ (mp betweai test adtadnlstratiCTisr~~ ~ 



Procedure 

Oiirte^ Kialysis grcfl^, based prinarily on natimal origin arrf associ- 
dated lir^ulstlc background, ymre defined for tte sti^, Biese gr^^ repre- 
sented a relatively \dde range of average perfonninca ot TO^, incidence of 
salf-r^rted test repetitiafi, anfl/or Imguage groiipsi 
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Gcdi^s krown to be eharaGteriged by higher incidence of repetitim 

01 teiTOtti 

02 Hong Koig 

03 Korea 

04 lhailmd 

05 Japm 

Groups faiTOn to be dTaracterigad by roedium ineldence of repetition 

07 Saijdi Ar^ia^ K^mit, Lebaion, Iraq, Jordan, Syria 

08 Iran 

09 CMle, toloicbia^ Itexico^ Peru, Vene^ela 

12 Greece, Ta^kmy 

Groups taCTm to be ediaracteri^ed by lower ingidence of repetition 

06 inclla 

10 Gennany, Nfetherlaria, Deraterk^ Noofl^y, Sweden 

11 Froice, Italy, 5|»in, Portu^ 

13 GtmmLf Nigeria 

In the c^se of grc^ 12 (Greece, l^rk^), grouping ymm based m strong 
similarities in t^ieal le^^l of perfonnance on TOE^^ incidence of self-- 
reported test repetition, and eKamdnee diaracteristics otiier thai language 
(wilsOT, 1982). These sarne groi^ were CTplc^ed in analyzing the data on 
institutionally test^ sxatdnees. 

For eadi isSC analysis group, the stucty exMiined (a) tiie test-taking 
patterns of first-tune Maminees (e,g,^ the nuaiber of 1-tiine^ a-tiro, , . , ^ 
t-Tiine test takers), (b) tine mm ctonge in TOmij total perfonnance for 
repeaters generally, ^ by nunber of tines tested^ and (c) mean dianges in 
TOiETj section scores* 

For a 20 percent s^ple of l&ac repeaters^ tiie st^ e^idned the rela-- 
tionship between change in TOfflli total score and selects nOTitest variables i 
time interval between the initial test adminlstratiari (t'-l) ar^ Uie last test 
adtdnstratiOT (t-T^ ^ 2, 3, , , T), nuntoer of times tested, location of 
the last test cmter (U.S. vs other), edueatioraLl lew! (graduate vs om^r- 
gra4jate), s^, age, mod score re^rting at tune of initial testing (reported 
lOfflli scores to institutiorm vs did not do so) . 

ftaltiple regression analyBis ms med to assess tiie relative COTtributiOT 
of these nmtest vari^les, in cenbinatioi with initial T^Ij total score, to 
predicticn of the last score of record, ihe total-^^mple regression equatim 
^s used to cOTpite Bn ^^ct^ t=T total score mm for ea^ anal^is groi^. 
Discrepancies betoeen ^serv^ ar^ ejected mtans for the respective mal^is 
gr©^ wre canted md evaluated, m ^^uatim v^s made of trer^ across 
analysis grco^ in incitoice of test re^titiOT anmg first-time exmninees 
classified according to score level an TO^i, 
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Miere feasible and aE^ropriate, parallel procedares vmrm used in 
maly^ing tiie data for institutianally tested eKardnees. 



Findings Regarding l&SC Repeaters 

TOst TOiking Patterns 

tome 28 parent of all first-=tiine examinees between Jvly 1977 aid Jwaie 
1978 imd to^Q or niore test records in the file as of Jme 1982. Between 40 affid 
50 perc^t of ^cMdnees frsn ^imn, Itong K©ng, Itorea^ Thailand, Japan 
Wire repeatersi totwe^ 19 and 24 ^rcmt of those from iran^ ^^ia-speaking 
Mideastern coimtries# ^uth ,toerica Mexioo, Greece^ aiid Turk^ repeated; 
less than 10 pereent (totween 4 percmt and 9 percent) of exarinees frcni 
india^ Ghana and Nigeria^ a:^ the Eur^eMv-GermaniG md Eur^eariHisinance 
groups had itiDre than one test record hy the eiKi of ttie sta^ period. 

A majori^ of laSC repeaters were me-time repeaters* tto^ewr^ some 12 
j^rcent of all test takers \^ere "multiple rej^aters"— that is, they took TOEFL 
Uirea or more times, incidence of rrajltiple r^etitim was M^er (over 20 
percent) for the Mian groups, v^idti had hi^ ^neral incidence of test 
repeti- tion, lower for the major Europem, I^ian^ and Af ricm ccaitin- 

gents (less ttian 1.5 percait). See rabies S--1 and s-2 for data on the thm 
number and percentage of 1-time, 2-tiinB, . . T-time test taJters, by 

analysis groiflp. 

Repeating ex^ninees, \dio accoimted for cmly 28 percent of all first- time 
t«st takers, accoimted for more ttan one Imlf (about 53 percent) of all test 
records in toe stu^ file for tiie entire ccAort of first=time test takers* 



Patterns of Chagige in Mean Score 

in every analysis gta^pf average ^ange in TOEETj total score increased 
TOtti number of times testedi; tiie amount of diarige associated mth additiciial 
testing wri^ across groi;^ ( see Pi^re S-1 ) . 

Bie first-time TOEEIj total mems for repeaters classified according to 
ZMTtoer of times tested during tiie stuK3^ period (2, 3# , . T-tlines)# varied 
^stematically. Mean tine-l WEE% total score mem decreasai as rftatber of 
times tested increased, iWo^time test takers had hi^ier initial (t--l) means 
thmi 3-time test takers ^ 3-tlmi test tiJ^ers Md hie^er t-1 mear^ thm 4-time 
test takers^ and so m. However, the fiml (t-T) means of the re^ective 
times^tested ^jbgroups were very similar — ttiat is, 2-tiffle, S-tiam, 4-time, . . 

t--Tlme test takers had conpar^le average time— T total scores. Mean t-T 
TO^^ total scores ^^ically approadied or slic^tly exce^ed 500 in groups 
mtii t-1 scores averaging cOTsiderably lower tlwi 500, Sudi re^ilari^ 
suggests that there may be a Glared perceptiOT of tiie "acceptable" score lewl 
on TQ^li, 

Stabili^ of TOCTT^ Scores 

•nie correlatiOT betoeen the initial arej tiie last obser^^ TOEEXi total 
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DlitfibuUOQ of Flri^tlii Tilt Tikirii My 1177 thrau|h Hu IPSOi AccdNlai to tml Huibar of Tut Rmrdi 

Ac£muU£id as of iuu lUh By Analyiis Grdup 
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05 


(U308) 


7118 


3134 


1710 


910 


545 


318 


18? 


114 


96 


43 


93 


06 


(19151) 


17442 


142S 


201 


52 


17 


S 


I 


3 


2 


2 


1 


07 


{20839) 


16562 


2535 


992 


383 


192 


88 


41 


23 


11 


3 


9 


OB 


(31605) 


23814 


4779 


1642 


704 


352 


126 


57 


22 


1? 


S 


7 


09 


(11021) 


12112 


1985 


60? 


195 


11 


25 


1? 
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2 


1 


3 


io 


( 6449) 


6207 


225 


15 
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( 1131) 


5111 


443 


54 


10 
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( 7461) 


5857 


igS4 


312 


109 


48 


36 


9 


1 


1 






13 


(20886) 


19621 


1125 


118 


20 


1 


1 












Total 


(221744) 


160401 


34194 


13542 


6138 


3260 


1777 


99? 


612 


351 


174 


281 



Table S-2 

FirsisUfi DistrlbutlQii of Firsl'-tlii Teg^ tikifii July 1177 iihfough hu IpiOj Aacofdiai to Totil Huibar 
of Tist EiQOfds AQQUSulitid ii of Juai 1112 1 ly Amlpli group 



PefcBQt ieitid diiipatid Huibaf of Cliei 



Group 


No. tm^ 


1 


2 


3 


4 


5 


6 


? 


8 


9 


10 


11+ 




























01 


(36950) 


5§.4 


23,0 


i 

10.3 


4,6 


2.4 


1.4 


o.i 


0,5 


0,3 


0.1 


0.2 


02 


(2327S) 


ie.4 


20.3 


9.1 


4.7 


2.? 


1.9 


1,2 


O.S 


0.4 


0.2 


0.3 


03 


(10178) 


57,5 


22,6 


10,7 


4,3 


2.3 


1,0 


o.a 


0.3 


0,1 


* 




04 


( 9480) 


59,5 


20.5 


9,3 


4,9 


3,0 


1,3 


0.7 


0.4 


0.2 


i 


0.2 


OS 


(14308) 


JO.O 


21.9 


12,0 


L4 


3.8 


212 


1.3 


0.3 


0,7 


0.3 


0,6 


06 


(19158) 


91,0 


7,4 


1,0 


0.3 


0,i 


t 


i 


* 


* 


i 


i 


07 


(20839) 


79,5 


12,2 


4,8 


1.8 


0,9 


0,4 


0.2 


0.1 


0.1 




t 


08 


(31605) 


75,6 


15,1 


5,2 


2.2 


1.1 


0.4 


0.2 


0,1 


0.1 


i 


* 


09 


(15021) 


80.6 


13.2 


4,0 


1.3 


o.s 


0.2 


0.1 


i 


* 


* 


i 


ID 


( 6449) 


96,2 


3.5 


0.2 


* 
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( 6131) 


91,7 


7.2 


0.9 


0.2 


0,2 
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12 


( 7461) 


78,5 


14.5 


4.2 


1.5 


0.6 




0.1 


* 


* 
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13 


(20836) 


93.9 


j.4 


0.6 


0.1 


i 


* 


9 




9 


9 


H 


ToUl 


(22i?44) 


71.3 


15.4 


6.1 


2.8 


1.5 


o.i 


0.4 


0.3 


0.2 


0.1 


0.1 



UUi Analfils poupi are 01 (Taivaa)| 02 (BoBg Kon|), 03 {Um), 04 (thailsnd), 01 (JiptD), 06 (I&dia)i 07 

(Saudi Arabia, Kuwait, lebaBOQ, Libya, kxiti, Syria, Iraq), OS (Irsa), 09 (Veaeiuila, Mesieo, Galoibls, Faru, 

Chill), 10 (Giiiaay, Nitherlaada, Disiirk, Horvay, iHidia), U (franci;, ipBlo, Italy, Portugal), 12 (Grieei, 
i ^ furkiy), 13 (Ghani, Nigeria). 
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Figure Plot of differencis biten W and H TOii total scon msans 
for repeiteri i:^ mki of tiniai teitid: ^ imlpis group 
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scora^ over t-1 to t-^ time inter^mls averaging about one year, *72 based 
en data for a 20 peremt sample of repeaters without reeprd to analysis group, 
niis may be conipared mth intenml consistency reli^ili^ coeffioients of .9 
ttat are routinely stained fo?r this score a test-pretest coeffioient of ,9 
ttet has re^rtod for a sanple of eK^nees reteited mth alternate 

test form mthin mm wek* Coefficients reflecting the relative standing of 
examiMes, t-=l to t-T, m TQEE^ sacticM were ,56, ,63, and *70, for Listening 
Csipreh^^ion, Structure Written a^ressim, aid Rising totpreheraion arid 
VocabJlary, ra^ctively. Listening ccitprehensiOT ms the least st^le ^d 
reading coi^rdiensian and vocataiary tite most stable of the ccnpcrent skills 
msasurKl by tte TOrai#* 

Owse findings, as well those regarding ^tterns of ^ange in mean 
scores, are cansistOTt mth tim asMrptiOT of gr^/th in tiie ^ills being TOas- 
ured by tl^ for l&SC r^aters betwren initial and final testing. 



Selected Cbrrelates of Totsl Seore Chaige 

In analyses based en a 20 perrent ^tple of repeaters, the sane set of 
four nontest ^mriifcles wis foutid to contribute msst to predictiOT of dimge in 
T^m^ total score between t-1 md t-T in ^al^es with a-d mthout control for 
t-1 score, ttiesa were rawfeer of Urns tested (liuQttsU)r tifm intor^^ in 
nmUis betoiMi t-1 md t-^ Clnter^)^ loraticn of trot ositor in toe U,S, vs 
otter locaticn (O^iter-^.B,), and ^|e^ With sots variatim in detoil, the 
pattern of findings vas similar across I^SC analysis groins. 

Based m anriyses in tte totol 20 percent sanple, wei^ts reflecting the 
relative ccntrlbution ^ ( in staidard imits) of the swen nontest variables to 
prtdictlm of total imt diMge, v^n t^l score vas COTtrolled, were as follows i 

variable Stmtordiied 

regression weigfit 

Inters^ in months ,152 
Cfenter: U.S*,1; oU^r,0) ,142 
Ntrfper of times tested ,115 
^ at t-^l ^,119 
Bex (male,l; fanale,0) --,065 
Educationai level ,016 

(graduate,!; imdergraduate,0) 
R^orted scores at t--l -,008 

(yes,l| no,0) 

These findings are caisistent witii toe assraptlOT of growth in English 
proficient for rej^aters— for escanple, more time and "effort" (vMch "tires 
tested" may be thouifit of as reflecting, at least in part) lid to greater 
change* Other thin^ being e^jal, youn^r repeaters md those tested in the 
U.S, t^ided to gain mora, cn tiie avara^, ttin did tteir older coimte^arts 
tested elsevA^re, 
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■Hie 'wei^iting of the CenterKJ.S, vari^le is cmsistent mto the logiral 
pr(^eitiOT that being in a nativs-aiglidi speaking envirament is CCT^Give 
to inproratimt in S^li^ proficient* 



Predicting t-T Itotal Seore Using t-1 Score aixi Nmteit Vari^lei 

In th© total 20 ptrcant sanple of rejeaters (N ^ 11,612), m equaticm ymm 
devel^ed (mUi tm score wi^ita) for predicting t-T score, using t-1 total 
score md scores on tlte s^/erml raitest %^riables as pr«!ictorsi 

t=-T (predicted) m mi {t-1) + 3.416 (Niitttsts) + ,681 (Interval) 

+ 12,637 COTter^,S. + 1.569 (Mlevel) - 1.107 (^e) 
- 5.877 (SSK) - .647 (Repjrts) + 167.391. 

Wds equation was allied to Uie praetor means for the 13 malYUiB 
groi^^ to obtain predicted t-T mifflis. ttble S-3 diora the ^served ari 
predicted t-1 arri t-T meais, and tte discrepaicy totween the tro, group* 
^served t--T mams \^ere 10 or inore ^ints M^ier ttiM predicted for four 
analysis groups I nMiely, the Euri^anHEcnHnce, South .toericm and Mexicm* 
Greek and lUrki^, amd aar^em-fermanic repeaters, Ihe average repeater in 
tiiese four groi^ haS t'-T scores ^^t were 10 or more points better ttian 
e^ect^, taking into accoimt tte t-1 score scores on all ttie nonteBt 
variables imaer consideratim. ^^minees frcm Taiv^ and Biailamd had 
observed t^-T me^ that were ssne 4 to 7 points Iwer tihm expected, 

O^se results may be due to differences among tire analgia groins in tiie 
amount, intensil^, Mci/or quali^ of tiie total aiglidv-langmge intervening 
es^riOTce for tt» repeating ^camineeB, .fudi fiiriings may also reflect^ to 
SOTa extmt^ differences in ^^ree of linguistic distance between Biglish 
and the various iMiguages involved, and associated differences in usual rates 
of acquisitiOT of proficiOT^ in Brigli^ as a secorri Imguage, 



^ndm^ to Repeat , by laltial Score Level 



It was assumed at toe outset tiiat the tenden^ to repeat iDEffh %^ries 
inversely ^th initial score level. Findings bearing nost directly on this 
assunptioo are siarrarl^ed in Figure vW,ch plots tte percentoga of repeat- 
ers ammg first-tiro test tokers classified in 20-point t--l TOEFL total-score 
intervals, for s^mral caisoli^ted dialysis groyps, 

E^cpi Figure S=-2^ it Is e^dent tiiat the t@^ta^ to reprat (a) was not a 
siqple llnMr dj^rse Anftion of Imtial maatm level In imy aral^ls gr^^ 
aoid Cb) stoai^y asaoaated mUi an^^is groi^ nCTb^rship, per wm^ 

mthin malysis groi^, tta inverse relatiOTsMp t^ed to hold crty for 
first-»time test ^feers mth hi^yr initial scores; the ^rcentage of repeaters 
actt^ly tended to decrease with score level for first-time test takers mth 
lovi^r initial scores. 

Figure S-2 indirates clearly that the tei^ency to repeat yms strOTgly 
related to analysis group iwtibership. For exMrple, at a t-1 score level ot 
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S=9 
Mpla S-3 

Diser^mcies Batw^i ^^rved aiid Bqpected Tint^T TOTL Total ScQres, 

toal^is Groi^, ^*ien ^qpaetaticn is tosed en an Ml-Rapeater EegressiiMi 
BjiiatiOT Obeying Ilin&-1 Seore and Seva^ Ifcntest ^^rl^les* 



Gcoap 


N 


(1) 
Tism 1 
actual 
(nean) 


(2) 
Tiiae-T 
actual 
(nean) 


(3) 
Mjne-T 
sheeted 
(meffii) 


(2) - (3) 


11 


101 


530,7 


566.9 


542.9 


+ 24.0 


09 


566 


465.8 


513.9 


501.5 


+ 12.4 


12 


307 


475.0 


518.0 


507.6 


+ 10.4 


10 


48 


531.7 


555.7 


544.1 


+ 11.6 


08 


1500 


434.4 


488.6 


483.0 


+ 5.6 


03 


819 


495.5 


521.1 


518.8 


+ 2.3 


06 


313 


501.2 


526.0 


526.0 


+ 0.0 


13 


214 


490.9 


517.9 


518.7 


- 0.8 


OS 


1346 


470.8 


503 a 


504.2 


- 1.1 


02 


1781 


489.5 


519.0 


520.1 


- 1.1 


07 


814 


439.7 


485.4 


487.3 


- 1.9 


01 


3070 


490.7 


508.8 


512.7 


- 3.9 


04 


733 


454.4 


485.1 


492.5 


- 7.4 


Total 11612 


474.6 


506.8 


506.8 


0.0 



Notes PiDB^yMB are based en a 20 parcOTt ss^e of all re^aters. 
Gr^jps^ or^red in tan^ of aetiml minus e^^pected tmBim, are 11 
(Franra, Italy, pQrtugal, ^in), 09 (^Chile Golontoia, texico, Pema* 
V#Msuala)^ 12 (Greeee, ^k^), 10 (Gen^y, Nfetharla^, D-mark, 
Norway, Sweden), 08 (irm), 03 (Korea), 06 (India), 13 (Ornxm, Ni- 
geria), 05 (Japm), 02 (Hong Kmg), 07 (teudi M^ia, Kurait, iml^ 
nen, Ir^, Jordan, Syria), 01 aiwan, 04 (nmiland). 

* E^^cted t'-T = .737 (t-"!) + 3.416 (NLflttBts) + ,681 (Intrvl) 

+ 12,637 (Ctater) + 1.369 (Idlwl) - 1.107 (Age) 
- 5,877 (Sex) - .647 (Reports) + 167.391. 
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Figure S-2. Percentage of repeaters by Initial score 
level on TOEFL for consolidated analysis groups 

(20 percent sampie) 




33 37 41 45 49 53 57 
35 39 43 47 51 55 59 
TOEFL total score divided by 10 
Midpoint of t-1 TOEFL total score interval 

Entries In ( ) are origmal analysis groups (see Figure S-1). 
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500-519 f tha^roOTta^s of repeaters anang first-tiinB test takers in the far- 
Eastorn COTtingmts were atoost six times greater ^an the percenta^ relat- 
ing OTong first-^tine test takers in the I^im^ Afriran, and major Eurcpean 
contin^its ("air^e" includes aial^is gro^s 10 m)d 11 — see page ^3). 



Evaluation of Findings Eecprdii^ l&SC Be^aters 

The laK r^ater firings indicate subitOTtial variatiOT atong groi^ of 
examnees differing in national origin and linguistic backgroimS vdtii respect 
to percentage of repeaters, incidence of ittdtiple repeaters (examinees tested 
three or more times) ^ md awrage mt gedn betwe^ testings. 

Generally speaking^ it is reason^le to infer frcm Uie stu^ findingR (a) 
that^ vAien adninistered to rramatiw speakers^ TmE% is measuring develcj^dng 
Biglislv^rofieieni^relat^ abilities and skills, arrf (b) that^ "the 
average, the ex^^mes vto repeat TOErL in Intemational md special Center 
test ;idttdnistratici^ ^^rience real inprovement in tiiose skills over time-1 
to tijDe-T intervals averaging a^rraimately roe year, 

"Am studf did not control for ^ecific intervening=^^rience wriahles 
sudi as the nature, amount, duratim^ intensi^, ai^/or overall "o'^ality" of 
exposure to tiie toglirti Imguage. Accordingly, thm observed reai'dianges in 
WmL score should be thsu^it of as indicating net ^Ins mthout re|prd to the 
nature of intervKiing ^^rimce— gains that may b& e^^cted to occur for 
similar groyps of l&BC exaninees imder tl^ ran^ of coodltiOTs (a) that 
prevail betoeen retestings for re^aters in intsmational Special Gmter 
test achninistratiOTs and (b) Uiat ordinarily influence tiie terxJency of exafH 
inees to repeat the test in ai isSC adminstratioi, 

Tmm Interval betoem initial aird final testings mv5. iMtber of tines 
tested (reflecting "effort," at least in part, plus increased "test=^seness," 
financial resources, and so on) wre correlated positively with dian^, as was 
being in the United Stotes at tiM of firal testing (a v^rl^ie tiioucfit of as 
reflecting degree of inmersion in BngliS); age vas r^gatively correlated with 
diai^p ttm other nontest \^rlidbles (sex, educational level, md reporting 
versus nonreporting of tlini-1 scores to institutions) cOTtributed OTly sli0it-- 
ly to the prediction of diange or timi^T total score* 

Stij^ findings indicate timt tte tmdency to repeat TOini is not a sinple 
Im^rse linear ftmotion of Initial score l^^l. Rather it ap^ars to be 
strongly related to factors associated mtti linguistic-cultural tockgroun^— 
for first--tiint ei^i^mes at ccspardale score-levels on TOini, the tenflen^ to 
repeat varied narkedly across malysis gro^s, ^y ttiis should be so is not 
clear, Furtoer stw^ ^^uld a^^ar to be rar raited. 

During tte ^riod OTvered this stu%^, a mlnori^ of first-'tirni isSC 
examinees repeated in imc adirdnistratiOTS generated a roajori^ of 

all test reeorctes repeaters generated more tton half of all test records 
acaitulated fcy th© cdiorts of test takers included in the sti^ file, althou^ 



tl^ repres^ted raly 28 perrant of all first^timi test t^ers in Utese to- 
horts. Ite Gonrantratlcffi of multiple test recordi varies by comtry of ori- 
gin, Ite mag^tyde of ttm average smre diaiges associated ^tii test repati- 
ticn and the presTODe of relatively lar^ differences anmg natiOTal-linguis- 
tic groi^ in mem soore ctoige surest that, for pu^Ms of ^inmary 
reporting of the TC^i, perforaBnce of \^riouE gro^, it is ii^rtmt to 
specif the test-rapetJ.tiOT status of toe exaninees in various reference 
grot^. 

Sy^lenaentary Finings s Short-^ltem Score aiange for Repeaters 

ttsted by institutions 

^ inferCTCe^ rest ins tituti ms that participate in the TOEFL Institu- 
tiOTal testing progrOT ^ TOEH* for facing studteiti in progr^ of mL 
instruction an4/or for ^mluating U»ir progress after instnicticn* Accord- 
ii^ly^ it is reasoo^le to asiime that nost exaniitees %«Aio repeat the ' iKMMt^ 
isider InstitutiOTal au^ires are m^ged in fontBl ES^ progr^^ ari that 
(Served average diai^s in TOmi for s^les of imT repeaters may be thou^t 
of as average gains associated mth tte ^ical raige of instruction in 
ingli^ over givm periods of tinie. 

To rbtain infcmation re^rdlng short-term score diange for repeating 
ex^TOes in the institaitim^ testing progran, steplanento^ Malyses \^ere 
urriert^en, some 10,000 individuals were identified as having tak«i TCm^ for 
the first time in an instituticml test adrdnistration between iJoly 19B4 and 
^gust 1985. .toout 10 perrent of ^se Individuals accmmalated at least cne 
additicnal test record during that period; of the re^aters, ^ut 80 percent 
had OTly ^ additioi^ record in the staa^ file for Ute period imder omsi(^ 
eratim* tte vmm msifcer of tests per repeater v?as 2*4, ad the average tiro 
interval betwem the initial (time 1 or t-1) test adninistration aid thB last 
d^rved record (timi T or t-T) ms 4.2 mmths* Sotta 85 percent of the 
repeaters vmrm tested by institutions located in tte mited States* 

Both^ f irst-ttoe test teJters generally Md tiiose \*io repeated during tim 
sta^ period h^ lower TOMi scores tiian did the I&BC exardnees studied, 
COTsistmit with the assuiptions regarding ti^ enrolljaent of inEtitutimal 
^caminees in EBL instructiOT; laSC ^w^nees include individuals at all levels 
of Biglish proficient* 

For 954 INCT repeaters mtii data m all stu^ ^riables, tim average 
anotttit of change in TCm% TOtal score, over t-1 to t-T intervcda averaging 
sligfitly more th^ four msnths, was 36 scal^ score pointe. *e nmtest 
%mriahles cOTtrifcuting most to predictiOT of t-T toetl total score (and 1^ 
infermce to diange, t-l to t-T) for the mST repeaters \9ere tina interval 
between t-1 and t-T ar^ being enrolled in a U.S. rather than a norb^j.s. 
institutim \Amn tested. 

m mm true for the i&SC repeaters, weBji change in TOm% total ras great- 
er for repeaters in seme nati^rml-linguistic amlysis groi^ than in 
otters, ^teuier based on mean diange in TCm% total score or the wan 
dlffer^ce totoeati otaerved amd expected t--T scores^ the ra*s of natimal- 
linguistic gra:^ anmg mST repeaters wre foinri to corresparf closely to the 
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r^^s for toe setfue nationals-linguistic gro^^ ancng exardnees identified as 
l&SC repeatari, 

aeia COTsistQicieB in patterns of differences mang aml^is groi:ps 
\^re fomd even thou^ most of tiie n^ST analgia groups ^re quite ctbII ^id 
tte cirMmtanees associated mth test repetitlan wire not cmpar^le for 
institutiQnally tested wd iaSG=tested repeaters. Budi eoniistencies 
strengttien an inference that there are differences among national-linguistic 
groups in characteristic rate of acqpiisition of proficient in Biglidi as a 
sectmd langua^^ at least as measured ^ ttm OTHi, 

Tq tte ^tOTt tiiat Vtm institutional repeaters were taking fecial in^ 
struction in Ikigli^ as a seMnd langua^ as assumsd, tto pattern of differ- 
ences in avera^ ^inm in W^FL perfornanre for DGT ^lal^is grou^ in tiiese 
SOTples may be ttau^it of as indicative of tim pattern of differences tiiat 
nd0it he ^qpected for similar groi;^ of eswidrees vim may ^rtici^te in 
"^>iral" progrOTg off specitl mL iMtruction, tffider "^iral" pragrma 
cOTditions, over time intervals averaging ^scut four nra^s. 0£ course, the 
fact that inost of tto an^^is grenj^ were relatively snail lijnits tiie 
accura^ of infermeep froft these data re^rding specific avera^ i^ins to be 
expected for various groijps. 

Eeseardi is Meded to obtain more detail^ information regarding tile 
actual circisnstances of indiviAmls ^ro repeat the in institutiOTial test 

adhiinistratiGns, tte diaracteristics of instructional program involved, the 
conditioM of test administratico, axl so on, ^suming tiie avail^ili^ of 
sudi inforinatiCTi, research involving test-pretest data f ran TOl^ institutior4al 
program files could be conducted ^th mL progr^t^ as units of malysis# with 
control for, say, duration of pr^ram, patterns of instructim, CTphasis 
on English for g^rieral purposes as ^^sed to Engli^ for specific acadanic or 
occypational pu^oses, md so on* k rapabili^ to track individuals between 
l&BC md TOST test administrations, if develc^d, woiid nake possible greater 
control over t>^ pre\dous ^^erience of IndiviAjals in studies coricermd witii 
grt^tftii in aiglidi proficieniy amsi^ institutionally tested i^caminros* 
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Patterns of 'Test Inking and Score Qiange for EScaninees Mio Repeat 
The Test OP mgli^ as a Foreign Language 



Remetii Wilson 
EijcatiOTal TOsting Service 



SectiOT I. intcodKtiGn 

Each yearf thoi^aids of non-native shakers of Qngli^ (largely inter- 
natiorial sbidmta ^atmlng tmtergraduate or graduate stu% in the united 
States) t^e tte ^st ^ Wn^lA u a ^rel^ langu^fe (TOEFXi) in order to 
d^nonstrate ealactad aEpects of tteir aoquired proficieney in Bfiglish, luatli 
providas separate wmmBOXmB of List^iing Cci^rehensicn Structure btmI 

Written S^ression Beading Ccnpret^nsicxi aM Vocabulary (rosv). A 

total score is also reported , 

Maiy of these individuals t^e TOCTTL mDre thai me time— tiiat is^ they 
are repeater examnees* Based on a study of the diaraGteristlcs of TOEFL 
examinees during 1977-^79 (WilsOT 1982)^ it is estimted that about on^third 
of all ©caminees teke WW% fflore timn one time; eitiier In regular inter- 
naticnal and Special Center test adndnistrations or in institutionally cmr- 
tolled test ai^iinistratlOTs, The incidence of test repetitioo varied consid- 
erttoly ty comtiy of origin, ranging frcin mider 10 ^reent to over 50 percent* 
Nati^al contingrats vdtii hi^ier percentages of repeaters tereJed to be those 
with low^r man TOEFL scores^ — a correlatiori of -*64 ymm found between these 
two sunnary statistics in data for 129 nativ^=comtry gron^. 

Findings such as ttese point tiie inportance of the repeated test-taJ<- 
ing or repeater plwionienDn, Emmmt, there have been no syst^natic ^tialyses 
eiti^r of the characteristics and test t^ing behavior of the repeating ^am- 
inees or of dianges In TOEITj perfonrBnce across repetitions. 



Some Wbrkuig As^x^ticns 

It is assumed that the tenden^f to re|^at roK*L is in^rsely associated 
with the initial score l^^l atteined m. ^aminee. First-time test takers 
in Inteniational and special Center (ISSC) test ^tadmstrations, without 
regard to coimtry of origin, ^^se Englidi proficiency as reflected in TOEfl* 
scores falls bel^ seme perSOTal or external criterion level are likely to 
COTtinw to ti^e W^Bli periodically imtll eitoer meet the criterion level 
or, failing to do so, drqp out of the TOEflj isSC exatdnee popJ.atlOT* 

It is fiirther asMtied (a) timt tal^^n testings mar^. If not most, HOWL 
repeaters atteipt to inprove their proficient in Bigli^ in a varied of 
mys—throu^ in^pradent sti^', tutoring, intensive stixiy of aiglirti as a 
secoid Iwguage in a focmal prograni, mi so en, mid (b) timt, on the average, 
some Ifl^rovenrat - in Bngli^ proficient is likely to occur anong repeating 
test takers betweafi maccesslve test adhdnistrations^ — iitprovsnent ^mt mil be 
reflected in average increases in TOEFL scores for groyps of repeaters. 
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Bmm oteerved inprwanent in test perfont^Ge ndgfit be to factors 
Qff®5 thai inproved profici^^ in the tegli^ language skills nea^red ^ 
lOTFlr— for axMple, increaiad familiarly with tiie ^cifie test formt, 
^^rally inereased "test-^s^^ss^" statistical factors sudi as r^resslcn to 
the irean, ai^ so on. lectors such as tiiese ^oiid be kept In mlrd In evaluat- 
ing oteerved Ganges in T^FL scores for repeating l&SC ^caMrees* 



Btu^ G^ectives 

This Btu^ mLB desi^ied (a) to identi^ and evalimte the rajor patterns 
of TMHr-taking for cohorts of first'-tiiie ^cMiinees in Intecnational ^ 
^cial Center (laSC) tost ^tninistratiOTiSr (b) to dcofflent average dhm^ in 
performance an imFL for the repeating randidatesr and to stu^ tte relation- 
^p of variables sudi as iMnber of re^titiOTS, time Inte^^ beteem test- 
ings^ and Initial score levels to score d^tige, (a) to stu^ tihe correlations 
between Initial arid aiteequ^t testing, (d) to assess the raitrlbutiOT of 
iwitost variables in cotEblmtlm mtii initial score to predictim of final 
ctearved standing on the T^PL^ 

lha average score diangBs oteerved for repeaters iirrier the range of 

COTditims that or^narily pr^^il tetwem test aAninistrations for I&BC exi^ 
inaes v^io repeat Ti^l. may be ttouc^t of as providing base line per^eti%^ m 
gain over time vrtthcut re^rd to intervenir^ variolas for individuals vAio are 
selected into the TOOTi I^SC repeater pcpiiation. 

In more specific terro, tto stu^ vms desi^^ to ^tain answers to 
gjesticms such as the folloi«,ngi 

(1) Wjat are the test-taking patterns of cohorts of first-tire MSC 
test-t^ers from dif ferrat native comtry aid langua^ groi^? What is the 
incicaence of OTO-time^ tocntime^ - . * # t-tine test taking? 

(2) p^t are the diaracterlstic ^tterns of diange in mm. TO^ii 
perfotmmce for bio-tlme, ttiree-tlme, • . . , t-time test ^ers frOTi a^fferent 
native coimtEy and laag^ge groi^? 

(3) pftat is the relaticnrtiip tetoeen change in TCmtj score and such var- 
iolas as time interval betoean initial last test adtadnlstraticns, msrtoer 
of times tasted, aga, sex, test canter (u,S* vs otter), edaraticnal level 
(im^rgraduate vs graduate), aM so on? 

(4) Wiich of ttese Tmriables COTtribute most to prediotiOT of last-tiitiB 
scores after taking first-tims romii scores into accamt? 

lhase questions pertain primarily to exatdnees tested in laSC test adtrdn- 
istratlons* Iha sti^ was also concenied mth ^teining evidence recording 
itert-term dmnges in T^li scores ammg repeaters tested ^ Instltutia^. 
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Data for stu^ wre cistained f ran teit files for 60 intematiOTal and 
Special Center test a&dnistrationE betoeen Jx^y 1977 arri Jxme 1982 ^ 
inclusive. Oiese files were cOTsolidat^ to form a "history file"— that is, 
nultiple test records for individuals v^ere collated by \asing msm, seK^ 
date of birth matciiing criteria* For individuals mth irore thai one teat 
reoord according to the matching criteria, records wtre ordered in tents of 
date of test ^IMnistration, from the earliest to the most recent* 

Following the orcp:iization of records to reflect the history of test 
taking for each in^vidiml in tte file, first-tina test trters during the 
three ytars be^^e^ July 1977 and June 1980 were identified using respmaes to 
a starriard questim on previous experience with TC^Ii, Individuals tested 
during era of the first 36 test adninistraticns ^re classified as first-time 
test takers if in registerii^ for tim initial test of record during that 
peri^ they reported no previous e^^rience mth 'lUMtli* Only the records of 
first-time test t^ers v*io plumed tmdergraduate or graduate stu% in the U*S. 
or ^nada (the primary -lOM'L popiiatiais) \^re included in the stuffy. 

The stu%' file inclined all the test records of first=time test t^ers 
during the pericxi July 1977 tiirou^ Jme 1980 tiiat had acccued as of Jxmm 
1982* Wius, tte m^imm time interval for a potential repeater varied between 
two years ^ for first— tijne test t^ers in Jmvs 1980, and five years, for those 
tested initially in July 1977* 

fforming ^Eialysis &oqps 

Gi^^ tim heterog©:^!^ of TO^^ exatdnee pcpulaticn with respect to 
national origin and linguistic tockgroOTd, arri the kno&m differences hy 
coimtry in Incidence of self-reported test repetition (Wilson 1982 )| it was 
deci^d to ^lect for analysis date for examinees from several countries or 
groins of countries r^fesenting a range of average perfornance on TOErai md 
incidence of self-reports test repetition, an^/or major language groj^, Bie 
iffialysls gcaqpB defined for the st^^ were as follov^i 

01 Taiwan 

02 Emg Ksng 

03 Korea 

04 Thailand 

05 Japan 

06 India 

07 Saudi Arabia^ Kurait^ Lebancm^ Iraq, Jorcto, &^ia 

08 Iran 

09 Chile, Colcntoia, rtexico, Peru, Venezuela 

10 GerraM^, Netherlarris, Deratark, Noway, &ft«den 

11 France / Itely^ ^pain, Portnigal 

12 Greece, TUrk^ 

13 Qi^ia, Nigeria. 

Escaminees from Greece md Turk^ were groined for ^ml^is berause of 
strong sindlarities in Imnel of perfonmmce on TO^L, knovm incidence of test 



29 



»4= 



repatititiai^ and otoer ^canijf^ dmracteriitics (see Wilsm, 1982, l^les 
^ 5.1)* Stutoits frcm these too comtrles also tend to 
^iibit siMlar patterns of perfoorance ot the Graduate tecord EScainlnatiaia 
General TOst (see Wilson, 1984, rable 13 and Figure 6.1), with the exceptioi 
of ^aminee eontin^nts frcm India and aana-Nigeria (anali^is gr^ps 06 and 
13, re^otiwly), vMdi are heterogeneoiis with re^ct to native language, 
eramMes frcm tiie selected natiw CGmtries tend to relatively homwneous 
linguistioally (wilsm, 1982). - 



Si^^enmtary Date on InstitutiOTally Itested Repeaters 

It is assLmed tiiat institaations participating in the TOOTli institutlOTal 
Testing program (raST) use TCmii to identify irxEviduals needing mh instruct 
tim m4/or to assess ttoir progress after a period of ^ instnietim, Bius, 
dian^s betosen initial md s^equent testings for i^viduals idmtified as 
institutloMlly tested r^aters may be thou^t of as reflecting diaigeB "asso- 
ciated mth the ^ical raige of ESL instructiOT offered by tfe institutiOTs 
involved over given time periods* 

Using tee same criteria for first-time test t^ers that were used with 
the KSC date, some 10,000 indi^d.duals were identified as h^ing t^kmn 
for tim first tijme in an iMtitutiraal test adtdnistration between July 1984 
August 1985. About 10 ^rcent of these first^-tiir^ institutiorml test 
t^ers repeated llM^ at least odce during the sted^ juried * Procedures 
CTplq?ed in Malyzir^ data for institetionally tested repeaters and related 
finding are described in detail in Sectim X. , 
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Section HI, test— l^dd^ ]^ttams of llrefc=-ttfflB Z&8C Q^mnees 

The 13 mal^is groups ^fined for thm stu^ ineli^ed a total of 221,784 
individiml mmammB took TOEFI, for the first time in ai l&BC adrtdniBtra- 
tion between jtiy 1977 Jmm 1980, inclusiw, Bit distributim of Uiese 
^caminees aceorcEng to repeater versus nonrepeater Btitus mi malyeis groi^ 
is sho^m in "^Im 1, Im of Jinie 1982, 61,355 ^canineei (27-7 perGmt) had 
repeated TOEETi me or more tinea aM 160,409 (72.3 percent) had not dme so 
(v?ere classified as one-tiniB test takers, or nmrepeaters) . 

O -Among the 13 groups, incidence of repeated test taking ^ried fraa 3.8 
percent to 50 ^rcent. 

o Five Asian contin^ts (airan, Hmg Kong, Korea, ftailmd, and Ja^n) 
roistered tte higfiest incidmoe of repeated test trtingi between 40 and 50 
^rcent had repeated at least cmce ^ Jime 1982. Moderate incidence of 
repeating (beto^eM 19 and 24 ^Edent) was fomd for four cOTtingenta (the 
Arabic- and l^rsi-speakers fron the Iflfeast, S^ttiish-speakers from the 
Americas, and the Greek-^rkirti coaitingent)* incidmce of repeated test 

toking (less tiian 10 percOTt) ms tm^ for gronfl^ 06, 10, 11, aid 13 (tte 
Indian, the major iurcpean, arrf the principal Mrican ccntingents). 

More detoiled data on test t^ing patterns for tiiese aial^is groyps are 
provide in Tablm 2 and rable 3 vMdi^ww, respectively, tiie nuniaer md tiie 
percentage of first-time exandnees between July 1977 and Juffie 1980 vAio had 
accutrailated ^signated niaibers of test records as of Jime 1982* From the data 
in TOble 2 it may be determired that tte 61,355 repeaters, vAio acooigited fo r 
orgy about 28 percent of all first-tdjna te?t takers during the study peridd, 
accomted for a total of over 180,000 test records, or daout 53^r^nt of the 
total of over 340,000 test records , CW.y 160,000+, or sham 47 percent of the 
test records for the period ^re accaimted for onetime test takers* 

Sane ^candnees (0*1 percmt) took more than 10 tiros during the 

stu^ period but only 12.3 percent of all test takers had mre thai tao 
reaprds m file (hid repeated more than cm time) ^ Uie end of the stud^ 
period— that is, were imilti^e repeaters* 

As migjit be es^cted, IncidOTCe of miitiple repetitiai was greatest for 
tiie national-linguistic analysis groups witii tim higfiest c^neral incidence of 
repeaters, lowest for those mth relatively few repeaters i more than 20 
percent of the examinees in groi^ps 01, 02, 03, 04, aid 05 v^re xaaltiple re- 
peaters as conpar^ to fewer than 1.5 percont of those in grow^ 06, 10, U, 
mj 13- In most of tte malyses of test performaice in the sti^, data for the 
^nall numbers of 11, 12, . • *, 20=i— tine test takers vmicm not incited since 
Ns were to ^mall to mrrait anal^is for tiiese tiros-tested sutogroi^s. 
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1 

Distri^tion of Firat-flm Test Mcers During the Wmriod July 1977 
throy^ June 1980 ^cording to Bepeater vmtmm Nonre]»ater 
Status as of amm 1982 



Status as of June 1982* 



Group** N Hepaatec Nonrepeater 







Na 




£M5 • 




01 


369S0 


16100 


43.6 


20850 


56.4 


02 


23278 


9687 


41.6 


13591 


58.4 


03 


10178 


4324 


41.5 


5854 


58.5 


04 


9480 


3840 


40.5 


5640 


59.5 


OS 


14308 


71B0 


50.0 


7158 


50.0 


06 


191S8 


1716 


9.0 


19157 


91.0 


07 


20839 


2909 


20.5 


16562 


79.5 


08 


31605 


7711 


24.4 


23894 


75.6 


09 


1S021 


2909 


19.4 


12112 


80.6 


10 


6449 


242 


3.8 


6207 


96.2 


11 


6131 


510 


8.3 


5621 


91.7 


12 


7461 


1604 


21.5 


5857 


78.5 


13 


20866 


126S 


6.2 


19621 


93.8 


^tal 


221744 


61355 


27.7 


160409 


72.3 


Median 






21.5 




79.5 



* 01 (Tid¥?an), 02 (Hac^ Kmg), 03 (RDrea), 04 cnmilOTd), 05 (aagm), 
06 (India) ^ 07 (Saudi Arifeia, asmit, LebaiMn^ Li^^ Jortoi, ^ia^ 
ir^), 08 (Irm)^ 09 (M^eo* Qiila, tolonbia^ Wmm, V^^mjela)^ 10 
(OmmBH^i J^Uierlands, Damark^ mrvmv, Swetoi)^ 11 (FrmnoB, Italy^ 
^>ain, lorta^.)^ 12 (Greece^ Turk^), 13 (Ni^ria^ ama). 

** Boaters are first-^time tast takers^ July 1977 throu^ Jurc 1980^ 
v4io had at least one ^Iditional test reoord in tite file as of June 
1982 1 nonrepeatars had only or^ test record in tte file. 
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Tabli 2 



miJFfbutloD af Fitit-tiii Tilt TikiM, July W1 through June IISO, Accordla| is totil Huabir af Test licordi 

Accuiulatgd as ef Juqi l§e2t By Anilyils Group 



foup 








Huaber of Uaai tiasted dutla 




Eakiri 


I 


2 


3 


4 


5 


U 


01 


(36950) 


20850 


8499 


3850 


1715 


895 


500 


02 


(2327S) 


13191 


4714 


2119 


1539 


631 


m 


03 


(10178) 


5854 


2350 


1588 


419 


230 


106 


04 


( 9480) 


1640 


1943 


801 


460 


280 


126 


05 


(14308) 


7158 


3134 


1710 


910 


545 


318 


06 


(11158) 


17442 


1425 


201 


52 


17 


17 


(20839) 


16562 


2535 


?92 


383 


192 


88 


OS 


(31SD5) 


23894 


4779 


1642 


704 


352 


126 


09 


(15021) 


12112 


1915 


607 


195 


68 


25 


10 


( 1449) 


6257 


225 


15 


2 






11 


( 6131) 


5621 


443 


54 


10 


1 


2 


12 


( 7461) 


5817 


1684 


312 


109 


48 


36 


13 


(20i8i) 


19621 


1125 


118 


20 


1 


1 


foial 


(221744) 


165409 


34194 


13542 


6138 


3260 


1777 



7 


a 


9 


10 




285 


182 


97 


54 


73 


269 


190 


151 


55 


75 


62 


26 


11 


5 


7 


p J 




I§ 


i 
6 


17 


187 


114 


96 


43 


93 


5 


3 


2 


2 


1 


41 


23 


11 


3 


9 


J7 


22 


17 


5 


7 


1? 


6 


2 


1 


3 












9 

s 


5 


1 


a 




997 


612 


356 


in 


285 



fabh 3 

FifeMtsge OiitributleB of flrit-UBi faiE fikiri, July 1977 thiough Juai 1980, Aecotdlag Eo Total Hunbai 
of Teat gieordi Aeeuaulated ai of Juae 1982i By ABslysls Group 

Psreeat Eistid diaigaitid ousbir of tlaes 





Ho. test- 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11+ 




tgksri 






















01 


(36950) 


16,4 


23.0 


10.3 


4,6 


2.4 


1.4 


0,8 


0.1 


0.3 


0.1 


0,2 


02 


(23278) 


58.4 


20.3 


9.1 


4.7 


2,7 


1.9 


1,2 


o.§ 


6,4 


0.2 


0.3 


03 


(10171) 


57i5 


22.6 


10.7 


4.1 


2.3 


1.0 


0,6 


0.3 


0,1 




t 


04 


( 9430) 


59,5 


20.5 


9.3 


4.9 


3.0 


1.3 


0.7 


0.4 


0,2 


i 


0,2 


05 


(14308) 


50,0 


21,9 


12,0 


6,4 


3.8 


2.2 


1,3 


0,8 


0.7 


0,3 


0,6 


Di 


(19158) 


91«0 


7.4 


1,0 


0,3 


0.1 


* 


A 


i 


it 


* 


ft 


07 


(20839) 


79.5 


12.2 


4.1 


1.8 


0,9 


0.4 


0.2 


0.1 


0.1 


i 




08 


(3160S) 


75.6 


15.1 


5.2 


2,2 


1.1 


0,4 


0.2 


0.1 


0,1 


It 


* 


09 


(11021) 


86.6 


13.2 


4.0 


1.3 


0,5 


0.2 


0.1 


i 


* 


* 


t 


10 


( 6449) 


96.2 


3,5 


0.2 


i 








B 






a 


11 


( 6131) 


■ 91*7 


7.2 


0,9 


0,2 


0.2 


t 


* 










12 


( 7461) 


78.1 


14.5 


4,2 


1,5 


0.6 


0,5 


0.1 


It 


jt 




■ 


13 


(20886) 


93.9 


5,4 


0.6 


0.1 


i 


t 












otal 


(221744) 


72*3 


15.4 


6.1 


2.8 


1.5 


0.6 


0.4 


0.3 


0,2 


0.1 


0,1 



MUl Aialyili gffliipi m 01 (faiwa), 02 (Hoag Koii), 03 (lorn), 04 (IhalUnd), 0| (jipia), 06 (India), 
07 (Siudl ATabla, Kuwilt, Lebanon, Libyii Jofdin, Syrli, Iraq), 08 (Irin), 09 (VBniiuili, KiiieOs Coloibla, 
Peru, Chile), 10 (Gimny, Bithirlandi, Diniirk, Nomy, mm), 11 (Fiance, Spain, Itily, fortugal) 12 
(Griaci, lurkiy), 13 (Ghana, Hliirla), ' - ' 

* Liii thaii 0.5 pifcue. 



Section IV* WatfyBtm of H^n Bmre-dhiS^, ty^ Anal^is ^oi^ 

^^9klm 4 stows initial or t-1) airf the final (tim-T or t-T) 

TDilli total reans for repgpters^ wit^t re^rd to nimter of repetitiOTS^ fcy 
Mialysis group, Avera^ dia^ (the diff ericas betwroi t-T Md t-1 n^m) is 
also stos^rt. Gr^i^ are ordered fran hi^ to Ic^ in terms of inci^nce of 
rapeaters. Bie varioj^ mal^is gro\^ differed In initial and final level of 
'i^cm'L perfommce^ as well as mm dimge, 

o ite midian diar^ (net piin) in WEFL total score mmm for tim 13 
m^ysis gtoi^ mm 31. 2 1 avera^ ^ins ranged from 19.0 CaitCTi, 01) 
to S3,6 (08^ Arabi^^pMkera from tte fU.deast), 

o Tha four dialysis gro^^ mtii tiie lowest parrenta^s of r^eators were 
among tim six IMtial test mems abwa ttie nedim for all grouj^; 

only two of the six grot^ vrttii ^Gve-madian pereentages of repaatars were 
Bbav^ the initial test srore median* fiiis is raisistant mth the assiwp- 
tion of m invarsa ralationdiip betoten initial test score and the 
ta:idmcy to repeat^ ^t Bam Icwer-scorinf ccntingants are mt Mrfi in 
repeatar rate. 

o Five of Uia six gr^^ ^tii &tmm mediai average gpin were arong tto six 
groups having initial test mms below tte medim for all groy^. 

Bie ralatior^iip hato^peen initial tast score toto and nat ^in is hi^ 
li^tad in MUa 5, ^^^ch stows initial and fiml tast mmm and the cor res-- 
pending diffarmws for the 13 mal^is grci;^, orderad frem low to in 
tarms of mam scora m tte IMtial tast (t-1). in ^la 5, tiie raw scora data 
f ram TOtaa 4 have bean subjactad to a 2-saala trmsfonrmtion vdtii re^ct to 
the distritatlai of most recent scores for ^1 TMi, axmineas tasted batween 
July 1977 and JmB 1979 plamed to stu^ in the U.S. or Canada (^ilsm, 
1982) — that is, feviatlcns from tto refarance groi^ mean ara asqprassed in 
tBmm of refermca group stmdard Aviation mits* 

Mjustad mean ^ins in TCmL toW. score OTOmted to me-iialf of a 
standard de\d.atian or mora for tto too Mideastam ccntirqmts (groif^e 07 and 
08), tha l^panirii ^aakers from tim Ericas (groqp 09), the Graak and Turkic 
c onti hgant (gra^ 12), md tte Fr^^^ Itallm, fpmidi, and Fortogese clxastar 
(grajp U)* tta aaLlest mean ^jiisted ^in (0.24 2-scaled \mits) %mB rag- 
istorad by the teiwarosa contlngmt (01), bat tto secmd Oiinesa-spartlng cot- 
tingent (02^ Hong Kmg) sho^ an ajjusted ^in of 0.41 z-scale units. 
j\isted pdns for tim rmalnir^ conti^^ts ram^ frem 0.34 for tha Korean 
contlngTOt (03) to 0.43 for tite J^irose r^atars (05). 

data in Mbles 4 and 5 point xjp tim mrkad differances OTcng analysis 
groi^ vath re^ot to botii initial md final tast meaM— 1.42 refaranc^rop 
standard daviatiom s^rated tha means for groups 10 arri 8 (tto hi^st 
aiid lowast scoring g^^^). with rec^rf to the t-^ mems, tto rmge ms less 
(1.15 s-seala units) bat still m^tantial. However, \^le 10 of tto 13 
grci^ of ra^aters had mtaM telew the general raference gro^ nean, only 
three trnd means telow that level. 
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Tcble 4 





Initial And 


Final TDIPL 




Fieans xoF K€peaseFB 








Regard 




Ft r ^snaFtf^lnnB 






Rep@a£ers** 


AQ n 1 n 




Difference 




No.f 


Parcent 


£ = 1 


t ™ A 




03 


7130 


50.0 


470.1 


304.3 


34.4 


01 


16100 


43,6 


489.9 


508.9 


19«0 


03 


4324 


42. S 


496s 9 


522 . 3 




02 


§687 


41.6 


490.6 


521.8 


31.2 


04 


3840 


40.3 


438.0 


488.6 


30,6 


07 


4277 


2S.S 


439,7 


484.0 


44.3 


08 


7711 


24«4 


434.8 


488.4 


53.6 


12 


1604 


21.5 


473,4 


513,4 


42.0 


Oi 


2909 


19.4 


463.9 


312,8 


46.9 


OS 


1716 


9,0 


508.0 


533.2 


25.2 


11 


510 


8.3 


523.2 


539,2 


36.0 


13 


1263 


6.2 


494.3 


522,4 


27,9 


10 


242 


3.8 


538.3 


567.5 


29. 2 


Hsdlan 




24,4 


489,9 


313,4 


31,2 



* Groups^ dfdarad in terms of InQldeaes of repeatsrs, are 05 (Japan), 
01 (Talvan)p 03 (Korea), 02 (Hong Koaf ) » 04 (Thailand) p 07 (Saudi 
•i^abla, KuwaiC* Labanorip Libya, Jordan^ Syria, Iraq), 08 (Iran), 12 
s^fraace, Turkey), 09 (Msniea, dolombla, Chile, Peru, Veae£uela), 06 
(lodla), 11 (Fraase, ipaia, Italy, Portugal), 13 (Ghaaa, Hlgerla) , 10 
(Qermany, Netherlands, DeaBark, Norway, Swedes), 

** Only data for 2-, 3-, , , lO^tlae t#«t takers were Included In 

calculating means, 

# He Include repeaters testad more than 10 times (that la, 11^, 12^, . 

• • f T-^tlnia test takers, data for whom were not Included In ^alcu^ 
lating the means reported la this table). 
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Table J 



Inltlsl and Fiaal TOEFL Total Heaos f©r Aaalysls Groups^ Ixpr^SSed as 
D#vlaCion8 froa Ehe Most Recant ioOre Mean for a General gaBple 
mi 1977-197f Exaalnaagp and Mean Differences With and Without 
Adjustttent of the Initial Heans for Unral lah 11 Ity 



GFOUp* 



08 
07 
04 
09 
05 

12 
01 
02 
13 
03 



7711 
4277 
3840 
2i09 
71S0 

1604 
16100 
9687 
126i 
4324 



06 176 
11 510 

10 242 



Median 



Initial 
mean (t-l) 

-0. 96 
-O.iO 
-0*64 
"0*54 
-0*48 

-0,43 
-0,11 
-O,20 
-0*14 
-0,11 

0,04 

0,25 
0*46 

-0* 2.1 



Final 

aaan (t-T) 

-0,23 
-0.29 
*0.22 
0.11 
0.01 

0*14 
0*05 
0.23 
0*24 

0.24 

0* 39 
0*74 

o.sa 

0,14 



Diffaranqa C t-l) = ( t-T) 
Observed Adj us tad** 



0,87 
0,61 
0,42 
0*64 
0*47 

0,58 
0, 26 
0,43 
0*38 
0*35 

0*34 
0,49 
0*40 

0*43 



0*64 

0*52 
0*36 
0,59 
0,43 

0*53 
0*24 
0*41 
0,37 
0,34 

0,35 
0,52 
0,41 

0.41 



^otm* In this tablep the agaled saora aaans ahoicn Is Table 4 hava been 
pressed as deviations froa the aost racant seora aaan of all TOEFL exaalnaas 
tested battfeeti July 1977 and June 1979, In standard deviation units* As ra- 
ported elsewhare (lllisQn« 1932), the aaan for this rafarance group was 505 
and the standard deviation was 73, For a^aaplei the t=l total seora mean for 
Group 08 (434*8, fr©a Table 4) was 70,6 aoaled-seora points, or 0,96 standard 
deviation units (70.7 / 73, 0)^ lower than the 1977-1979 referanea group aaan 
of 505; tha t-T aean (483*4) was 16.6 sealed seora points (0*23 standard da- 
viation units) below the rafarenea group aeani and so on* 

* Orottpap ordered froa low to high in taras of aean soora on the Initial 
taatp arai OS (iaudl Arabia, Kuwait, Jordan^ Lebanon, Libya, Iraq, Syria), 
07 (Iran), 04 (Thailand), 09 (Chile, a^iloabla, Maxleo, Peru, Vanasuela) , 05 
(Japan), 12 (Graece, Turkey), 01 (Taiwan), 02 (Hong Kong), 13 (Ghana, 
Higeria), 06 (India), 11 (Fpanee, Italy, ipaln, Portugal), 10 (Geraany, 
Netherlands, Danaark, Norway, Sweden). 

*• In aalepilating the adjustad differanees, the t-1 aaan was regrasaed, using 
an estiaated rallabllity of *90* Rallability astlaates are not avallabla for 
first-tlae taat takars who rapaat. Thus, dlfferenoas involving tha adjusted 
tiaa-i aeans should b^ thought of priaarlly as illustrative of *^ragrasslon 
affects. " 
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Seoticn V. IteBi Boom Oienge ^ libber of Times Tiiet^ 

tte data in Itol©s 4 and 5 reflect average differences betvreOT the t-1 
aynd t-T TOBMm tatai SOTres for rapeaterB mtibout r^prf to rMttoer of test 
repetitians* ^^.e 6 diows the t-=l and t-^ TOMTj total score neans for 2--^ 
3-^ . • .f 10-tine test t^ers in the respective malysis grou^. On tolancef 
the overall picture is one of added ^ins with ad&d teitii^* 

ttie t-1 wmmm of maltiple repeaters varied inversely mth nmt»r of times 
tested* BcOTinees mth gnly ^id test records had higfier t-=l meaM ^mi ttose 
with three ^ those \rtth ttiree tost recottls tod Mgjter t-l mems thm those %d.th 
fourf and so cn* Wiile ttere wre TOme ndrKjr rmrersals in this pattern, it 
yms tte prevailing cne* 

TlmBS^tasted ma^roi^ did not differ very nudi mtii resptct to level of 
t-^ performance. For subgroup with t--l means below 500/ t— T mems aHproadiing 
or slii^tly hi^ier than 500 were ^ical, in groups 02, 03, 04, 05 
^CTninees tested five to 10 tiroes had t-T means ccq^rable to those of 
^go-time test takers. In groy^ 07 and 08, five md seven tine tost t^ers 
had higfier t-^ means thm tlwse test^ only tm tunes* For the fcur groups 
wiUi very low incidmce of repetltiOT (groyps 06, 10, U, amd 13), for tfie 
hi^incidence aiv^nese COTtingent (group 01), and for grei^^ 09 and 12, 
there a t node st decline in t=-T means across times-^tosted syflagrot^, 

T^tie 7 rtiows differences betem t-1 md fmrnm for examirees test®! 
desi^mted nuntoers of times. Gro^ are clustered ty overall incidmce of 
test repetiticn^ from higfi to low. 

As esqsected from the data in nhlm 6, mean ^in tended to increase with 
additimal testii^ — individuals trting TOTH thrre times gained i^re on the 
average than those takir^ cnly^ce, fmir-»time test t^ers ^ined mre 

than three-time test takers, and so on. 

At all levels of repeated test taking, rej^aters in aralysis groups wiUi 
hig^ incidOTce of repetition (01, 02, 03, 04, and 05) diowed lower man gain 
than repeaters in tte four aralysis gro^ wiUi moderate incidence of tti.^t 
repetition (07, 08^ 09, and U). CcMi^ring OTly two-tJjte test t^ers, the 
largest repeater subgroup, ^ins for the hi0i=incidence analysis groyps wre 
loiMt (rmging betafl?eOT 12.4 and 19.8), those for the noderate-incidence 
gro^ were hi^iest (rmging between 31.4 and 41.6), airi gains for the 
low-incidence groips were in^tween (ranging from 22,5 to 32.7). 

Sunilarities md differ^ces ramg the 13 analysis groi^ in patterns of 
mem gain for indi%dduals tested 2, 3, . . lO-tiros, as reported in rable 
7, are pointed gr^hically in Figure 1. teveral clijsters of amalysis 
grou^ based cn generd. slmlari^ in ^in patterns are discernible i (a) 
gra^ 07, 08, 09, and 12, (b) the contingents frcm "nTailaKl, Hong aid 
Japm (grafe 04, 02, and 05), (c) the rairanese md Korean ccntingents 
(gmps 01 and 03), althcugh these may be thougfit of as coistitutlng two 
distinctive patterns rather than a cluster, (d) tiie Gennmic and Ronance 
Imgua^ (Eur^m) ccntin^nts (groi:ps 10 and 11 ), aid (e) the cOTtingents 
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fable 6 



Fits! and Last AdilttlilraEleD Hsin TOIFL Teui Icons for Sipiilirs, 


by NuDbir of Tiaii 


Tiilid and A 


nalysis Qri 


3up 


firenp/idalai 


a 


3 


4 


Koibef of Eliig UstPi 
5 i 7 


8 


9 


10 


Ail 

indlvld, 


01 t-l 
I-t 


soa.? 
Jisa 


416.0 
501.0 


473.4 
500,1 


46S,2 
495.2 


456,0 
492.5 


446,6 
4S7,0 


450,6 
49i,5 


435.1 
479.6 


441,2 
482,9 


489.9 
508.9 


02 1*1 


S02.3 
S2S.i 


489.2 
521.5 


478,2 
520.2 


472,1 
52§,0 


46§,4 
526,6 


466,1 
524.4 


460,3 
529,6 


433. § 
529,6 


411.6 
521.3 


490.6 
511.8 


03 T-i 
M 


504.4 
522,8 


496,8 
525.2 


483,9 
111,8 


490,4 
520,9 


468.6 
513.9 


463,4 
518,2 


410, 2i 
498,0 


440.eb 
496,8 


453,8a 
496,0 


496,9 
522,3 


04 M 
f*E 


485.3 


457,7 
491,4 


448,9 
492.5 


444.2 
493,6 


437,89 
492,3 


423.0 
487,3 


434,3a 
498.2 


413.4b 
500,2 


426, 5e 
481.0 


433.0 
488.6 


OS M 
T-t 


483,3 
502i8 


46§.7 
504.8 


460,4 
508.6 


453,3 
506.2 


44S.1 
504.2 


440,2 
S09.7 


433.8 
502,6 


410,6 
499,1 


432,11 
509,0 


470.1 
504,5 


06 M 

t-E 


511,2 
137.7 


474,4 
IDS, 4 


463,8 
504.5 


430. £b 
533.4 


411. 4e 
524.2 


463.4c 
494,8 


4 


* 
ft 


* 
ft 


308,0 
533,2 


0? M 


449.3 
481,1 


431.8 
484.8 


421.1 
482,8 


413.3 
491.6 


411,0 
491.6 


421.6a 
490,6 


401.6b 
486,4 


422.6b 
520.4 


• 


439.7 
484.0 


08 T-l 
I-E 


442.1 
483i7 


428.0 
492.4 


420.7 
498,7 


414.0 
502.3 


413.3 
502,7 


397,6 
509,9 


412,1b 
501,4 


394.1b 
493.5 


399,6b 
499.4 


434,8 
481,4 


09 ■ M 
M 


477,1 
Sli.O 


447,7 
50i,4 


433,9 
506,2 


426.6 
508.4 


415.2a 
496,1 


403,1b 
497,4 


392, 3e 
30§,2 


* 


ft 


463,9 
512.8 


10 f-1 
f-t 


140.1 


lll.lb 
571.0 


t 
« 










• 




538,3 
567,5 


11 M 
M 


527,4 
560,1 


419.3 
555.8 


472. ib 
533,1 


i 
It 


* 

A 


* 

i 








323,2 
551.2 


12 W 
I-t 


4Sa.4 
517,8 


457,3 
Sll.i 


437.3 
509.0 


433.5a 
509.0 


414. 4a 
509.4 


3i4,l£ 
48S.fi 


420, 6e 
508,1 


t 
t 




i73,4 
115,4 


13 T-l 
I-E 


m,i 

524.0 


47S.2 
598,1 


475.1b 
512,8 


♦ 
• 


• 










494,3 
mi 



I M it Eha ifia tiae^l Eiit leori; M U Eha aiiti aeofe oo Ehi liiE EeiE o| umL Thi 'ill mm/ antriii 

Lli°'^ V;l' J • ^"^"^i*^- " SlEllsrliDdi, Homy. Binaark 

iBiiiD), ll,(franei, Italy, Spain, PerEugal), « (firisei, Tyrkiy), 13 (Ghani, Hiprli). • . 

* im EhaD 3 easeii i (H . as-49)| b (H - 10*24); e (H • Hh Sit Tibli 2 for m« Ns, ill citiprlia. 



Difference tetoeen initial airi Final TOrai Total Iteans, 
by Nlmber of Hmes Tilted and Analysis Gr^p 

(Anbar of times tested 

6 7 a 9 10 

36.5 40.4 41.9 43.8 41.7 

58.2 58.3 69.1 70.8 69.7 

45.3 49.8 47.8a 56.0b 40.2c 

54.5 59.3 63.4a 84.8b 54.5c 
59.1 68.5 68.8 68,5 76.9a 

80.6 69.0a 84.8b 97.8b 

89.4 112.3 89.3b 99.4b 99.8c 
80.9a 94.3b 113.9e 
95.0a 104.5c 88.0c 

72. 8e 





2 


3 


4 


5 


01 


12.4 


22.0 


26.7 


30.0 


02 


18.3 


32.3 


42.0 


53.9 


03 


18.4 


28.6 


34.9 


40.5 


04 


19.8 


33.7 


43.6 


49.4 


05 


19.S 


35.1 


48.2 


52.9 


07 


33.8 


53.0 


63.3 


69.5 


08 


41.6 


64.4 


78.0 


88.3 


09 


38.5 


58.7 


72.3 


81.8 


12 


31.4 


54.3 


71.7 


75. 


06 


2?..S 


35.0 


40.7 


S2.8b 


10 


27.2 


51.9b 






12 


32.7 


56.5 


60. a» 




13 


27.1 


33.0 


37.ib 





* 01 (teiwan), 02 (Ba^ K&r^), 03 (terM), 04 (Biailma), 05 (J^an), 
07 (SaMdi Afitoia, Jordan, ii^mon, Lil^, Iraq, ^ia), 08 (iran)^ 
09 (Oiile, CDloBfcia, ftexiCQ, Wmm, Vmmzml&L), 12 (Graece, Tiirkiy), 
06 (India), 10 (denwy, Jfetherl^ricte, Dmmmrk, mr\f^, Sweden), 
12 (rrmae, Italy, ^pain, Fortupa), 13 (Ni|pria, Otona). 

a N * 25-49 
b N ^ 1^-24 
c N « 5-9 
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Fipra L Plot of mtm^ tabitn t-1 and W m total mans for 
rpitsrs in thi repetive ailpis grop, ^ nuiir of tuii teitsd 




from India amd Gt^m^geria {gmuBB 06 aid 13). 



Oqfttrast in yo^g for TOTOi and graters , a detailed pieture 

of tim pattama of t-l mfl ffleans for 2, 3^ , * * ^ lO-tine test ta*e« in 
the reqpeaU.vt mwiymiB groype is provi^d in Fi^re 2. Brdcen lixmB 
comactii^ md ffleais ata wad to iiiiicata breaks in the epical 
ixpmtm relatign^p beto^e^ fr4 mwi and nmber of tiines tested* Wot 
mKmflm, in the amy fer T&i^gan^ S-ttm test takers had sligfitly hi^fl^r fc-1 
wmmB titan did 7-time best takers, tite t-1 rem for ICMtime test takers mm 
soraii»*iat higfier ttan that for ^tire test ^ers, »d so m. ite horiscntal 
line for eadi arrasf MLmtmm tte mean TOMj total score of ^1 first-tiro 
test takers (nonr^eaters plus r^Mttrs) in tiie aralysls grwp* 

A plot of W. md M? Verbal test mtans for 1^ 2, . . S-tiro ®l 
test takers, adipted £tam tiMxm r^rted ^ Bode and TOrts (1979), is 
is^uded for ogqparaUw puip^s (page 1 of fi^pire, lower ri^t f rOTe) * 

for TOEKi r^Mters, a strong inverse relationdiip tatw^i t-1 mean and 
nunber of tires tested is oLmtly e^teit in Fi^e 2. fi^ gmeral conpara- 
tdliiy of tiie M reams of 2, 3, , * T-tire test tekers is also e\ddant, 
PSBOtMj tJie five hi0-inoidmce md Uie four roderate-'ineidmre ccntingmts, oriy 
the tire-^ reans of TOdwrese repeaters rtw a nrferately ^cliniii^ gradient 
as msber-of-tises-test^ Ij^reases. ftor tim Mntinfmts with la^ g^ieral 
incidenre of test r^titicsi, md few oultiple r^eaters, trends are nsixed, 

Dlffer^ioes in ^ttams of wmn srore diar^ on the Om V^Aal itet for 
mm r ^eaters (largely native Bnglish ^akers) and rean score-<aiar^ for 
^mFL r^eaters (nomatlve nq^ish^^^eakers) are ^te dear in Figure 2, FOr 
&iB repeaters, tim rean increase to verb^ score tai^ to iSealine \d.th nmber 
of tires tested* nws, the array of t-^tlDae rems for 2-, 3-, rad 5-tire 
test tadcers has a diai^y dedlniJ^ gradimt. rest-reomt test rem for 
3-Ure test takers m& ocs^tO^m to tte t-1 rem for 2-tire test takers, the 
mc^t recent rean of 4-tlre test tekers is like tta initial rean of 3«-tire test 
tidcers, mfl so on. m addition, iwre of tto GSm ttoes^testod repeater groi^ 
had ffiMt^recent score reans tiiat rea^d tto ^piiatian valija and or^y tim 
smt-recent score wem for 2-tire test tiJiers ^proadied the p^ulation 
value. 

Ihe patterns of score for TOU- culti^e r^eaters, especially 

ttime in ti^ Gcntii^abs \dth hi|Ji and ino^rate incid^ce of rq^etiticii, 
eontrast diai^y \d.th ttm obsei^^ ^ttems of srare ctomge on the Verbal 
test for ORE rq»eaters« For TOEKi r^eaters, (a) le^l of ^rforrance m tim 
rest recmt test ^ministration it-JT) for 2,3, ... , t-tire test takers 
igspears to be ind^rodint of tt»ir average lewis of t-1 ^rforrence, and (b) 
t-^ reans tmd to ^jbI or surpass tte rem for all flrst-tu» test takers In 
tiie re^eotlve grc^jps wA K«^t-rerent s^re rem for all TOEFL mmim^B, 

In tiie los^inoidrnM conti^mts (groups 06, 10, md 13), sere 
tenden^ for tim t-tJme reattis of reltipla r^aters to fall belo^ tim mm for 
all first-*-tire test t^ers is e^dteit in Fic^e 2, For three of ttose 
<»itif^pnts (06^ lOf mA U), tiie rem for all first--tire escaninees \mB hi^ 



Fl^ira 2« Plot of Initial wd fiml TCEFL total jntws for 2^, • • •# 
10-tijfts test tidcers, by amlyBls groi:p^ coBparad mth plots of initial and 
oost recOTt CSE verbal test mems for 2-, 3--, 4--, 5-tire test takers 




Ilia I I . . ylim _ _ I I 



N " 16,100 



!fetei N ^ lOOf for SLd^^o^ 

a-»^, b-2W9s c^J0^4, 



05 JAW^ 




fee* 



1^ aMn^QEadm 
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Figure 2, fage 2 
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(over 550), and repeaters in thw fourth low-incidence gtaip (13), and in group 
W, nay ha^ had sutotanti al ej^ sure to acadanic instructim in Bigli2T 
mm, at tarn e£ iMtial TOraCr-ttfdng, «^i«es in U» 1ct#- inddaioe 
ca^g^ts have cea^^ a mre adranoed level of maturation with remect 
totto^ttLo^Hit erf Bni^di proficiaicy Ujot Mndnees In tte M^t-inciffice 



I, Wf^^^: m wtt^rns of roultiple test taking . Biere were relatively 
sha^ difiesrsJOs, cy anal^ls growp, in the behavior of repeater swtogrouw 
with co^^j^]^ t-l ineans. More ^cifically, there were differences in ^ 
persist^a (reflected in nuntoer of times tested) of repeaters across analysis 
groi^, relative to awracp score gains and initial score levels. Figure 3 
rtows, uaustratively, the means for successive test administrations for 2-, 
. . ,., t-tune test takers in four analysis groups, two mth hidi and two 
mth ne^rate overall incidence of test repetition. 

in each of the four analysis groi^, a repeater sutaroim with a first- 
tuw mean of ^raeunately 450 (a "t-l 450" ajbgroup) my be identified, in 
contingent 07, ttie "t-l 450" subgroup took only one adaitiaial test and earned 
a t-2 mean approarfuj^ 48S| in contingent 09 this siigroim repeated TOEBl, 
earned a t-3 mean exceeding 500, and in caitingents 01 and 02 the 
t-l 450 Bi^roi^ were snade \^ of S-timB and 10-tim test takers, with t-8 
and t-10 means of ^raclmtely 490 and S20. 

It seCT^ reasonable to believe that Uw "t-l 450" subgrcnm in acam 07. 
whidi weraged aromd 485 on tiie final testing, mi#it have ^tainfe a mem 
approacfliing 500 on a third ^ninistratiOT of TOESl, after ^ditiOTal effort 
m^mt, ^my did i»t persist, cn the other hand, the "t-l 450" repeater 
siffigroi^ in tte Hong Kong caitingent, %Au.ch averaged ^roedmately 485 at 
t-3, persisted Uirojc^ 10 test aAainistratians. tte reasons for such differ- 
CTices m test-t^ing bdiavior are not evident, rfane of the tinBS-tested sub- 
grw^ in anal^is group 07 (Arabic Mideast) persisted throu^ attainmnt of 
t-T nisans of 500 *^ite mem cpin patterns suggesting the feasibilitaf of at- 
taining this epal witii mo^rate a^tiaial effort. At the sore tine, repeat- 
ers mth MBparable t-l means in other analgia groups persisted throudi eiriit 
to 10 testings and earned t-r means a^roadiing or exceeding SOO. 

SucA^a pattern um^ reflect differences in perceived accanplirtment— the 
final perforrance of 2-, 3-, . . ., t-tine test takers in analysis to» 07 
was quite hi#i relative to the mean for all first-tinB test-takers a^ for 
tnat reason have been perceived as sufficient, ^le in analysis groups 01 
and 02, mth considerably M^r tlaiB-l means, the score level associat^^th 
co^arable gains may have bem perceived as insufficiait. Additional stud^ is 
meded to sted liejit m tim d^miics wicterlying the differences in test-taJcino 
patterns that are characteristic of the various contingents of examinees. 

These findincp si^port the preposition tiiat in a population of U.S.-boinrf 
foreiep exOTttnees for whom Bnglidi is a secaid language, l^ffL is measuring 
^loping Biglish language verbal skills, and that in the largely native 
English-^ak- ing GRl pcjulatiOT, the verbal abilities neasured tar the CTe 
vertol laeasure are more fully devel^jed, thus less aronable to diange. 



47. 



ERIC 




H 450" 



02 Hong Kong 




01 "Mm 




ibii, Jordii, KiMit, 
iifcp, ^ii, Iraq) 



3 



"t-l 45D" 




09 (fliilfi, Colcnbii, Hadra, 



Figure 3, Hiins en succisiive Ipii sMnistritiens of 2-, 3-, . , Hm 
test takers in four imlpii groyp 
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SectiGri W« Oiange In WEBIm SecUcn ^ores 

raUes 8 and 9 Morarize dato on tte b-1 and t-T section soore neans* and 
tte rarra^gndJi^ ^££ara:^s in nearis^ by aniL^^is gtmp, in ^^la 9^ dato 
are ptmBrnitBA for anal^is grai^ priced f rem Mfh to lo^ in tani^ ef jnem 
chmiga on iti^li total, taidcs of tte grai^ on tte saction-SGora mean diar^ 
values ara also d^iv^. 

For ai0it of tite analgia grou^^ the lowest neam mtm cbtaimd on 
Stmctura and wittai ^^rassion for only or^ groyp ms tMs man tte 

hi|^st, ^r six grcu^, nean Ustmlng Co^f^wision acorea vmtm hl#^ 
est; iBMns on Mad^n^ Oo^rtfimaion and Vo^bwO^ary (BS^j i^ra Mg^st for al^ 
adtttional groii^. Ot tiia six groi;^ with the hig^st mm total score pdn^ 
five were among siK M^hmmt miners on smf and Wmf all h^ L&^Lin rimks 
babi^an 1 and 6, oat of 13. 

Across analysis gronj^^ r^nka mth ras^ot to mean BSK ^In oorraspondad 
a bit mora closely to ra^s on totel piin (rho ^ .962} Vtmi ^d ranks on W 
(rh© « .924) or (rho ^ .902). 

Figwire 4 shows cc^arative profiles of t-1 and sactim acora maa^ 
for r^eaters^ and coi^r^la profiles of loeins for nonr^iaatera^ in aadh 
analgia g/oup. In fiva aralysis grc^j^ (02, 04^ 05, 07, and 03), tto final 
(t-T) maana of rapaators (brokm liros with ^ts) exeeeded those for renre- 
paaters (solid lines w.tii x's) an &H ttirae sectiOTa. m four groi^ (06, 09, 
10, and 11) the ^posite rmm true, wA in tte renaming four gro^^ (01, 03, 
and 13} the t-T toum of r^mtars \^ra lower ttan those of nsnr^eatars 
on at l^at one of tiia sactigna/ 

E^e^t for the KorMn rapMters, Vtm t-l and t^m profiles of 

r^aatora (solid lines mtii ^ts) and Hm prof lias of nonr^eatora ware 
roughly jaradlel— nonreptetars in gm^ 03 (Itorea) h^ m at^ically hi0 mean 
on Idstming Omp^iimmicn, md tim t-^ i£ ma^ of repeatora was sll^tly 
loMr than that of nonr^aators, idthoi^ on both am and BCW, the ci^eafaer 
t-^ me^ns surpi^^d tha maiuna of Mnr^aaters, 

SiMlaritiea md ^ffar^^s mmg tha analysis gro;^ wltii raspact to 
relative perfonnance on the re^ective TOffiL sections are disoemible at toth 
t-1 and t"^, rafiaotii^ slndlarities and dif fermoas in tha relativa develop- 
ment of tta oorra^onding^igllA langua^ ^llla. Bapaaters from l^iwan, 

mtjgi niallmd^ and aw^i (gnups 01, 02, 04, and 05), for ascan^a, had 
gmer^ly similar profiles. Profiles for Indiw and Koraan r^eaters (gtoi^ 
03 and U) differ from the ottiers, baii^ diaraotoriEed by ralatlvaly low mawa 
on IC, wlUi progressively hi^iar maans on md RCi^. tMigue mtmg the 
aralysis gro^a, tinm profiles of repeaters from Qmna and Nigeria (groqp 13) 
^d^ited consideraUy l^i^r t£ Vhma Bm rmmm. Band somiwhat hi^^r than 

maim. 

Profiles for tte two ftt^astem analysis grou^ (07 and 08) ara e^^rally 
similar, Qharaotoriged by m^tmtially hi^r LC tiian Bm meaia^ and sonevAiat 
hi^t^r sm than means, Suiilar statem^ts mi^t made ibout the otter 
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01 
03 
03 
04 
05 
06 
07 
Oi 
09 
10 
II 
13 
13 

Madiaa 

JMJJ^ AS«lrai« ireupp »• 01 Cf«l««s)» 01 (idSi isag ) ^ 03 CEsra*}, 64 CfhallA&d). 03 ( j«psa) p Oi <li^il«>» 07 
{Saudi Arabia, Ey«al£, jSEdaa, LabaasBp Llbr«P Syrl^. Iraq). OS Clraa) , Of (Vaaasyalap ^asise, QoloBhla. »aru. 
Sbila)p 1© CoaraaBSf. laEgariaaiai DaaBark, m&Tvmi, SifadaB}, li (Fraaqa, ipaia, iialy. PorEeial}. IJ (drae^ai 
fyrkaj), 13 CObaBa» Hlfarla). • > - ^ ■ 

• hC (LiataBiag OattpeabaaalsB) | SHI (BEEueEara «ad WflCEaa ispraaalaa)! l€iV (laailBi dsapVahaaalda aad Vs- 
€abulary>« 





svt* 


IGAV* 


fas 


4i.l 


4S.1 


49.7 


489.9 


49.2 


41. 3 


49.8 


410.6 


47.7 


4f .4 


51.0 


496.9 


47.3 


44*9 


43»4 


458.0 


47«9 


46. i 


46.5 


470.1 


4a. a 


50.6 


33*6 


50i.O 


47,7 


43.4 


41.8 


439.7 


47.6 


43.0 


40.8 


434,8 


4B.i 


43«1 


48.3 


4i5«9 


57.6 


51.3 


31.4 


338«3 


51«3 


30.4 


34«A 


333.3 


50.1 


45.9 


46.0 


473.4 


4i.i 


51,3 


31«1 


494.3 


4i«6 


48*1 


4f.7 


489«9 



LO 


itfl 


ICAV 


foe 


30.9 


49.9 


51.1 


308.9 


31. i 


31.3 


52.6 


311.6 


30.1 


53.3 


34.3 


533.3 


50.3 


47.9 


4b»4 


488,6 


51.5 


49.9 


30*0 


504.3 


51.9 


33.1 


54.9 


333,3 


51.1 


4^.0 


41,1 


484*0 


53.1 


47il 


41.3 


488,4 


33.0 


41.7 


51.3 


513.8 


60.0 


35.1 


55.3 


567,3 


36.1 


34.4 


5ni 


559.1 


54.0 


30.3 


30.0 


513.4 


49.6 


34.1 


33.1 


333*4 


53. r 


3G.3 


51*3 


313*4 



Tmblm 9 

Haaa Ghaaia. FifaE es L«at fait AdaialaEraClsa, la fOlPL iacElaa 
iosEavs Aaclfali gEsup 



■ OEsUp 

OS 

09 
07 
11 

11 

03 
03 
04 

10 

13 
03 
06 

01 



7704 

3906 
4368 
li64 

310 
7037 
9611 
3833 

243 
1365 
4317 
17IS 

16017 



LO* laak 

C5.6) 1 

C4*4> 3.3 
C4*4) 3.5 
(3.9) 3 

4.0 4 
(3*6) 6 

3.4 7.3 

3.1 9.5 

(3*4} 13 
3.4 7.5 

2*5 11 

3*1 f*3 

1«S 13 



Haaa Ehanga 
iWB* Eask 
3,1 3 



5.6 
4.0 

13 

3.1 

77U_ 



1 

3.3 
3.3 

3 
7 
8 
9 



1Q4V* iagk 

4.0 3.5 

4_. 3 1 

T7q 3.3 



{1.7} 

3.3 
{3,8} 
(3*0) 



3.3 6 

{2.9} 10 

2.S li 

1*6 12 

Ij^S 13 



2,8 a 

3.6 12 

{2.3} 10 

{1.9} 13 

(2*1} 11 



faeal 

33.6 

46.9 
44.3 
43*0 

36.0 
34*4 
31.3 
30*6 

39.2 
27.9 
25.6 
25.2 

19*0 



*° ggj*^ tsEal agara gala . aEs 08 {iFun) 

S^-j„-®J«^?' ^/f W^* 12 (OEaara, f yrkay} » 11 {PEaac*, Ital^r, 

IpalB, paEEuial}, 03 (Jmpaa}, 02 {Haai Esbi). 04 (Thallaad}« 10 (SarBan! ! 
NaitiaElapda, BaaBaEkp ivadaa, VSEvasf}, 13 (HiiaEla, GbaBa}» 03 {KsEaa} G6 
{lBdla>» 01 {Talwaa}* . 

* LG (LlBiaaiBg CaBprahaBalaa} | ilfi (SEEusEUEa aBd IfriEtaB Bspreaaiea); 1G4V 
llaadlai OsapirahaBBlsB aad VscabulaEj}* DndaEasaFlaf ladieaEaa t^ha aaEEiaa gb 
tfblch Eha Eapaatara la aaah aaslyaia gESup had Eha laifaaE t^l agaa ^ Eha aae^ 
Elaa High tba hiibaat Baaa la ladleaEad by paraaEhaaaB. — 
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01 (Tilvin). N ■ IlifOO 
(i4i Fiptfllin) 



as 



65 



i 90 



4S 



40 



61 



6D 



4S 



40 




LO if 1 ^i? 
TQf FL tiCtkn 



(III fiptsif^i) 



69 



§1 



4S 



4d 




ifE Mky 



64 fmdNnd)^ N a 3»S4d 



ii 



1 15 



fid 



45 



40 




TQiRi e«sll#fi itsn 



1^5 (a^ft)i N * 7,isa 
eol 



i5 



I 



45 



4Q 




LC IfE ^le¥ 



03 CP^iTM) N m 4,314 
(411 rtposlsf^) 



65 



60 



45 



45 




Flguro 4. FlriMlmo (t-1) gnd final (t-T) TOEFL 
sectlen'igOPi mioni for ripesterf releliva to sedion^ 
isore medns for nonripiotef&i onDlyils ffoup 
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i 



id 



40 



01 (^iftj, N a 7,71 i 
(2411 ri^ilin) 




6^ 



45 



49 




Nonfipfotiri 



IS in Rc^v 

fOCFL wisim 



to 



; 65 



N m m m np«ltn) 



1^ 
I 

i 



40 




l£ gWl RCiiV 



(lis npiQliri) 



90 



6S 



60 



4S 



40 




l€ iWE BCiiV 



§9 Csjem^q, Mtxlis, 

FirUj Viniiifiis}, N m 
(ill fipigiifi) 



eg 



ISO 

u 

u 

S 4B 



40 




go 



i 

g 
8 



11 i^mcSi iialy, Spglii, 
(91 nptotiff) 
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40 




t^c gwE mky 
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£wr gr^ps* Berha^ tl^ aeet pertifmit g^ieraliaatian is that the develqps^t 
of the cgffpoftmts of Br^lish proficifflt^ represOTtrf by tte TOEl* saeticns 
saos to TOTdn ralatlTOly coTOistrnt across teat r^etltiQna. Wot eadi 
analysis gra^p there are alaar similarities in toe pattermr^ of t-1 ai^ t-T 
M^cfv-score wmm profiles of r^eatersi witii the ex^ptlcn of the Rorean 
eonUi^nt^ the profiles off r^eaters Mir^eaters for tha re^ertive 

malysls grc^ are siJallar in titmgmm 
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SeotiOT VII* Eti^cted Qbrralatas of Qiange m TOtad Snre 

precrtlng sectians have be&a amcmmsA priMrily ^th pattens of 
TOEEti-t^ing and averafe chai^ In aUfc^f'L scores for ti» analgia grca^. O^s 
sactim disaussas (a) salaoted ^rlaUes as correlates of ehmge in TOraXi 
scores^ arri (b) ttm relative rentf ibutien of ttosa vrnti^ms to predioticn of 
final (M?) total soere. ljW.ess otherwise TOtad, all tte an^y^s reported are 
based CO a 20 parcmt rarxtom mflm of exifflttirees (rapeater N ^ 11^612). 



The wriablas seleottd for stu% wre as follows i 



\^ri^e Definitim 

Q^^^fi or Difference bet^^^iCT Uie last seora of 

DiffarCToe (D) reTOrf {tim t or and tte initial 
score (timi 1 or 

LC^ ListCTing Conprd^isicn dian^ 

^/^B^ Stracture & TOittan Exprassim changa 

TOsv-D Beading Con^rahension & Vocabulaty chaige 

TOtal-D TCmL tot^ aianga 

First-time (t-1) soora ot variabla 

TOMf-l 

W-t Last seora of record (t-T) for variaW^e 

KSV-t 
TDtal-t 



Intrvl 
Nynttats 
Bdlavl 

Canter-U.S. 
Beports 

Age 



Inte^al in msnths batwaan t-1 and t--T 

Ni^ber of tinaes tostad (2^ 3^ p * 10) 

At t-l, graAjate m 1# imdergradiMte = 0 

Mala ^ 1^ Itaale m o 

At t-^^ U,S, cOTter ^ 1| Othar ^ 0 

At t--l^ desi^iatad institutim to recaiva 

score report ^ 1; did not do so ^ 0 
At aga in years 



TiW.a 10 shows tiia means of the 13 amlysis grm^^ and of tt» total 20 
pareant smpla of repaatars^ ot tlte nm-tast wriaKLes. The follow^ ing 
ganeral piotura of TmeL r^aters md ttieir behavior mergesi 
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Table 10 



Means on SalecCed Variables for RepeaterB, by AaalyelB Group 



Variables* 



Group 




Xatrvl 




Edla^l 


Bmm 




01 


3070 


11»6 


2*9 


0.83 


1.41 


0,08 


02 


1781 


13.7 


3.2 


0, 26 


1,34 


0.06 


03 


819 


11,7 


2.8 


0.76 


1,26 


0.26 


04 


733 


12.9 


3.0 


0.85 


1,44 


0.32 


Oi 


1346 


11*3 


3,2 


0.50 


1.34 


0.40 


07 


814 


12.6 


2.7 


0. 34 


1,07 


0.6S 


08 


ISOO 


i.i 


2.7 


0,31 


1.21 


0.79 


09 


B66 


10.1 


2,5 


0.60 


1.26 


0.69 


12 


307 


10.2 


2.6 


0.56 


1,16 


0,34 


06 


313 


15.5 


2.3 


0,82 


1.08 


o.ia 


10 


48 


11,5 


2.1 


0.46 


1.27 


0.15 


11 


101 


10,6 


2.2 


0.78 


1.24 


0.22 


13 


214 


15.9 


2.1 


0,06 


1.09 


0.22 


Total 


11612 


11, i 


2.9 


0.57 


1,30 


0,32 



0,34 
0.24 
0.38 
0.34 
0.41 

0.35 
0.33 
0,46 
0.4S 

0,72 
0.46 
0.S7 
0,57 

0.36 



Aga 

25.1 
19,3 
25.3 
23,8 
23.8 

21.8 
21.3 
23.9 
21.4 

23.3 
22.5 
24.0 
22.2 

23,0 



Note ! 



Groups are Ol (Taiwan), 02 (Hoaf Ko^ng) , 03 (Korea), 04 

(Thailand) p OS (Japan), 07 (iaudl Arabia, Kuwait, Jordan, Lebanon 
Iraq, Syria), 08 (Iran), Of (Chile, Colombia, Mexico, Peru, Ven- 
ezuela), 12 (Graacep Turkay) , 06 India, 10 Geraany, Hecharlands, 
Denmark, Norway, Stfeden) , 11 Franae, Italy, Spain , Porrugal) , 13, 
(Ghana, Nigeria). 

* laterval In months betwaan Tl and Te i aoBber of teacad; 
edueatlonal Ic^al (graduate ■ 1, undargraduata ■ 0)| sas (M ■ I, 
F ■ 2)1 unter (last tesC in U.S. ■ I, not in U.S. 1 0)| raporca 
(1 ■* designatad institution to receive t-»l report, 0 ■ did not do 
so); age in years at time initial testing (t-l). 



** Analyseo ara 
repeaters , 



based on a 20 parcent random s ample of all 
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o On tim mmr^gm, tlmse repeateirs tested (in ^[teniatlcral art 

qpacial Ototpg aAnlnl&tratlsm) ^^roxinataly three tiroes (roan Niantats « 
2.9)1 ti^ averaip InteEwl in mmtte betweOT ti^ first teat adtainistratian mi 
thi last of recQCd was dbout ore year (11,9 rossitiia). 

o icrae 30 percOTt of ths rq^ators ^re fmalee (i«an sex « 0.30); 
avera^ afi at t-4 was 23 ^ars* More tiwi em half of tl» r^atere (57 
par^t) ^anred fr^uate in the ttiitrf Statai or Canada at tire of 

initial tastir^ (Mlavl ^ 0,57)^ but slightly TOre ^tmsi a»-third 

^sigpatad institutlOTS to receiw thair t-l TOIlj soore r^^rt (rean reports 
^ 0*36). 

o ^rocimtoly ma tiiird (32 parent) ^ra in tta XMtad Statas 
th^ took the last tast of raoord (roan Omtar-UiS* « 0.32), Data mt tablad 
indlcata that about six in 10 of tiiis siisgraft^ of ^caminees took both t-1 ard 
in tte mited Statas. 

o Ttere \ijBra diffarmces among tirm 13 aral^is grw^ ^tii reject to 
aadi of tta nmtast wriaUes. 

Ifeans, standard da\d.ations^ md intercorralatioTO of tte tast and nm- 
tast variaMes in tte 20 percmt of rapaatars, \BidiffarMtiat^ vitii 

record to anadysis groyp^ are in T^le 11. Bie ijpper portion of the 

tiW.e ^lows interrorralations of Ute 12 test variaMas; tte ndddla portion 
(rows 13 Uirougji 19) shew tte sinpla oorralaticn of rantast vari^as with the 
12 test variaM.es I and ti^ bottem portion ^mm tha interaorrelatia:is of the 
sefVOT TOntast wriablas. 

Tte coaffiaients in row 4 indiMta^ for ^canpla^ tte corralaticn of 
change batawn t-1 and t-T in total seore (TOfM) vdth eadh of tim otter 

test variaMes* Tte ooafficimts in row 13^ for mmaglB, indiaata that tine 
interval tetoi^aen t-1 and t-T was positively relatad to change in TOEHi sacticn 
md total scores, negligiKLy ralatad to t-1 sooras^ and positivaly ralated to 
final (T) scoras* It way also te detannired frcfa tte middla r^R?s that change 
in TOEHi total (TOM) mm ne^ti^^ly ralatad to Age (r ^ --*13), Edlavl (r ^ 
-pl2, and (r = ^.08), indirating som^t graater average ret ptins for 
youncpr rapeators, tjndargraduatos^ and inalesF toat r^eators ^signatad 
institutions to ra^iva tiieir initial SOTre r^orts h^ sli#itly lower average 
gain (r m -,07) than tteir oounto^rts did not hava tteir reports 

s^t to institutigns^ and so on. 

In a^iiatii^ tte eorralations tetrom tim initial sooras cn TOTli and 
tte diffarenM or diM^ tetaem t-1 and t-T, it is Ij^rtant to ka^ in mind 
that ttesa coeffici^ts do not reflect tte relationship betoean "trw Initial 
standing" and "trw pdn," since TmFL is rot a perfeotly raliaKLe measure of 
its underl^jig construct* Lord (1967, pp. 33-34) tes provided forniias for 
estimating tte relationship betoeen true Initial stmding asrf tr«? ^in. For 
data of this sto^, asstming a raliabili^ of .9 for both t-l md tte 
estimates correlation for true t--l total score staMing and total score ^in 
is -.39, vety close to tiie ^er\^ coefficient (r ^ -.43), fflie sindlarity 
batoaen tisese two coefficioits si^ports tte construct wlidity of lu^'L. 
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Table 11 



Meane, Btand&wd Dgvlatlona, and Incereorralacsna of Variables for Repeaterfl 



¥«ri«bl« 



1 


LC-D 


2 


iWE-B 


3 


RG&V-D 


4 


f OT-D 


5 


LO-l 


6 


syg-1 


7 


RO&V-l 


8 


fOT-1 


9 


LC-T 


10 


SWi-T 


11 


EC&V-T 


12 


TOT-T 


Haat] 


i.D. 



1 

L€ 

p 

1,00 



2 

SHI 
D 

,30 
1,00 



3 

D 

.32 
.37 
1.00 



Interoer rslaCXoaa of ease variables 



4 

TOT 
D 

.72 
,75 
,74 
1.00 



5 
1 

-.47 
-.08 
-.13 
-.30 
1.00 



3.3 
5,7 



3,2 
5.8 



3.1 

5.4 



32. 
41, 



48,4 
6.1 



13 


Zntarval 


.13 


,17 


.17 


• 22 


-.03 


14 


Humt:ats 


,19 


.13 


.19 


.25 


-.19 


15 


Bdlaval 


-.11 


-.09 


-.08 


-,12 


-.01 


16 


ian 


-.08 


-.08 


-.04 


-.08 


.08 


17 


Canter 


.24 


.19 


.20 


,28 


-.09 


18 


ReporCs 


-,04 


-.05 


-.07 


^.07 


.09 


19 


Age 


-,10 


-.08 


-,10 


-.13 


-.14 



i 


7 


g 


9 


1 n 


11 


12 


iifi 




TOT 




SMI 


RCA? TOT 


1 


1 


1 


T 


T 


T 


T 


-.17 


-.19 


-.32 


.46 


• 08 


,05 


,23 


-.47 


-.18 


29 


.19 


,39 


-10 


.27 


-.23 


-.50 


-. 35 


.16 


• 08 


.27 


,20 


-,39 


-,38 


-. 43 


.37 


• 25 


,18 


,32 


.45 


.46 


,73 


,56 


,40 


• 40 


• 54 


1.00 


.72 


.88 


,29 


.63 


• 61 


.62 




1.00 


,88 


• 28 


,60 


,70 


.64 






1.00 


.44 


.66 


, 70 


.72 








1.00 


.48 


.45 


.75 










1.00 


,73 


.88 












1,00 


.87 














1.00 


46.4 


47,5 4 


74.6 51 


• 7 49 


.7 50 


,6 506,6 


6,9 


7.3 


56.5 6 


.1 6 


, 6 6 


• 5 


53.8 


chsraoCarlatlos with Caat variable 


1 


.01 


.01 


-.00 


.09 


,17 


.15 


.16 


-,li 


-.16 


-.20 


-,02 


-.01 


-.02 


-.02 


.16 


.22 


,16 


-,11 


.09 


,18 


,07 


.06 


-.00 


.05 


.01 < 


-,00 


-.03 


-.01 


-.33 


-.38 


-, 33 


.13 ' 


-.18 ^ 


-.26 


-.13 


,13 


.13 


.14 


,05 


.09 


,09 


,09 


,02 


,12 


-,01 ' 


-.24 ' 


-.05 


.05 


-.09 



laCarsorralaCloas of sharaetarlstlos 



Inter- Huo- 
val Cacs 



Mean 



S.D. 



13 
14 
15 
16 
17 
18 
19 



Intarval 1.00 

HumCata 

gdlevel 

iaK 

Cantor 
Rapor £8 
Aga 



p39 
lOO 



Edlev 


iax 


Can 


Re- 


Age 










Car 


ports 








-.02 


.00 


.05 


-,03 


-.09 


11,88 


9.21 


.01 


-.01 


,04 


-.11 


,03 


2.87 


1,39 


1.00 


-.01 


^.22 


.00 


,44 


0.57 


0.50 




1.00 


--,06 


-,04 


-.10 


1,30 


0.46 






1.00 


.03 


-.08 


0.32 


0.47 








1.00 


,04 


0.36 


0,48 










1.00 


22.99 


4.45 




to©k last teat in U.S. canter, 0 ■ elsewhere)! RaporCa Cl « dealgnatad schools 
Co racelve score reporea, 0 m did not do ao)i Aga (In years at tlae of t-1). 
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n» cireunstanees (correlated errors of memsursnmt) le^ing to elepaaits 
of EjporiwmBmB In correlatiar^ betwe^ Initial stan^Lr^ md gpdn baMd on ^fso 
less than perfectly rall^le tests (or otitor fli»sures) ara not in\^lvid "^Ami 
other \^ri^es are ror related mth dte^e (Los^, 1967), ffius^ for ^ci^la, 
the correlations of ^mrlaUes sudi as intarv^ batmm t-1 mnA ^id mvter 
of tines tested vdth diangi, or mtii t--l or t-^T TOTOi scores, are not affTCted 
by correla^^ errors of siaasures^t^ P^sw^s^ly, ^^len atoerved aorrela- 
tion b etoe^ t-1 srore and diarjgs is intr^^mi to oontrol for ii^ti^ stmd- 
ing on 1t]EFL for pur^sas of assassii^ tte relative rantributicn of ngntest 
^ri^las to diar^, OTrrelated arrors of roeasureBent ara mt invol^md (see 
follomng sacticn)** 

Stifcili^ of 1!^% srora s . Ooafficimte reflactWMf ttie stabili^ of 
relatlva ^trnmiT^, to t-<Py of ^^uniiiBas in term of parforranro on in^ 
sne^ Xi^, Bud Ttitali are dio^ in toa si^mtrix qpeclfiad by colums 9-12^ 
rows B^t ^1 to ^UU^ ooamcd^ite mn ,56, ,63, ,70, and ,72 for 
Use respective naasures, List^^i^ asspt^hmmim mm the least stadble, wd 
reading conprdiOTsion btA vocabula^ \mm the nost staUa of the 
lEiglish-lai^^ge skills measure ^ toa teEFL, 

Ihm stabili^ coeffficlant (*72) for T^m^ total, over periods averaging 
i^rc^djaately em year, nay ba CCTrpared vdto the intom^ ccrmistm^ relia- 
bili^ coaffici^ts of ^pproxlniataly .9 reported for this scora (a,g,, Eau»- 
tional TOsting Sarvica^ 1983), md a CMfficimt of *92, reportad ^ Srantcn 
(1983), for scoras of 98 stutente in an int^isive Im^guage pragraza on 
fonos of lUfeJ?-L adodMstarad one weak ^art, ^afficimt is id.so lower than 
^xq^raUe test-^retost ste^ili^ coefflaimts Invoi^dng verbal adnission tests 
in saqpl es of U,S. axaioin^s (see, for i^c^^a, Mdec^, 1981 ; teck bt^ 
^rts, 1979| misoti, 1983, ^dbla 4), Ite dlstinctlcn batwa^ "Avaloplng 
English languaga skills" in sarq^es of ^cminaes anS "devalcpad verbal 
reascning abilitlas" in sazq^as of U,S, aKaMi^es is polntad by botti (a) 
tte lo«^r stibili^ coefficimte for TC^rL in sm^es of axianineas, ^tmn 
for Q[^lidi laM^ge verb^ admissiofi teste in sa^^es ccnp^sad primrily of 
native ^^akars and (b) tiie diffareicas in patterra of scora di^^ for TOEFL 
and rapaatars (^own earllar in Figure 2), 

aa tima-1 to tiraa-T stadDili^ coeffioimt raflacts tte t-l/M? correlar- 
tion in a saspla cooposad of 2--, 3-, , ^ . , 1^-tlnE test takers, for toa 
to t-^ Inter^^ varied markedly, a^ra^ng ^out 12 monUis, Trmds in 
t-Vt-T corralation for 2, 3, . . . , 9-tiffle test tAars (tiia HMbste su^ 
groi^ mth N ^ 50+ Ih the san^e), as ^11 as to score smns rad stmdard 
de^daticns for eadi test a^tainistraticm, are tinam in ^iAm 12, itm overall 
stability cMfficimts, t-l to t-T, tmd to c^rrase as lUPttts incraasas — 
toat is, stability t^ids to teclim as average mBOunt of cimygm increases (ard 
as the average t^l to t-T tiaia interral incrraisas), thou|^ mth scna reversals 
in tte de<dinlng trand, Ite data in ^Mim 12 

* Ttm writer is grateftil to Brmt Bridgman for calling attention to these 
in^rtant points. 
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fabla 12 

trends in CQ^^mhiloti ^^etwean Tlme^l and Tima-T Total 
ScoteSi by ^yibir Times Tested , for Repaaters 

Wl^M ^Bg^mmrd to Anslyals Croup 

CorralaClon b^lviin t— seora and soora for daslgnatad 
#d^^ In Is t rat Ion 

Ipi 1,6 1,7 1,8 1,9 (S> 

6,638 
2,577 

<tt3.9) 1,128 
^ 574 £18,2) 608 
644 *586 <20,S) 307 
^- 688 .666 ,594 C20-7) 175 
^ 687 .691 ,632 .645 C22.9) 104 
757 ,70J .73i ,684 .152 C24,l) 51 

Mesa laterval la Bostlis b^tma d^^slgnated adBlQlsCratloBS , t--l to t-T- 

Hmmum l§r desC^DAtad adBlnlstratloas 



Tlaes 


1,1 


1.3 


1,3 


1»4 


tasted 










(2) 


1.000 


. 784 






(3) 


1.000 


. 800 


*674 1 




(4) 


l.OOP 


» 788 


*71S 


m 


(5) 


1.000 


. 76S 


.73i 


>m 


(6) 


1.000 


. 710 


• 76S 




(7) 


1.000 


.803 


.750 




(8) 


1.000 


• 762 


.714 


.700 


<9) 


1.000 


. 748 


.767 


m 





1 




2 


3 


4 5 6 7 




8 


9 


(2) 


483.8 


507. 


5 












€3) 


468.8 


488. 


4 


50i#6 










(4) 


459*9 


477. 


4 


490iS 










(5) 


452. S 


467. 


6 


479*4 


Ana 50 3.6 








€6) 


452.1 


468. 


0 




Alhl 49 .e. 2 506.2 








€7) 


446.3 


462* 


2 


472.6 


419.4 48 ^.2 492*7 500.9 








€8) 


445.7 


462. 


2 


472*6 


MU 48 ^,5 492*3 495.7 


509* 


5 




€9) 


440. S 


457* 


6 


463*^ 


4?l,g 47^.8 483.9 488.8 


495* 


2 


501.9 












$immtd davlatloas 










1 




1 


3 


4 5 6 7 




8 


9 



(2) 58*8 57.8 

(3) 52.3 49.2 49. § 
C4) 49.2 45-0 45.8 

(i) 46*8 44.0 44.3 40.9 

(6) 47.3 41.0 42.0 M 4C>.9 42-1 

(7) 45*8 43.1 44.6 Uli 4^.9 45.3 47.7 

(8) 44,1 42.7 3i*2 M %& ,1 40.7 41*5 49.9 

(9) 46.0 44*5 48.7 4s , 4 40.2 43.9 48.1 49,7 

Notai Ha for naans and st^pditd d^^latloas ara as shown for corral- 

atlons. These analjfsas a^e b^iid an 20 percent sampla of repeaters. 
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attest further to the devel^ii^ nateire of tha ^ills in#amired by^ tte in 
samples of ^OTdneeSt 



BdLative QSE^ribyUcsi of itonfc^t V^rl^l^ to Fredlotln c£ 0wgB 

In orter to assaes the relative mntrlbutlcn of tht TCntest wri- 

liMes to the predlctian of dhmgm in TOERL total ^tiiin ea^ of tte 13 maly- 
sis groyps aurd in ti^a total Bmagim^ t^ njltiple regression analyses \mrm ^in- 
ducted, u tte first iral^is, di^Kie in it^'L totu (TO^M) ras regressad en 
ttia s€vsn npntest varlAles o^yi in tim seomd anal^iSr iiy.tl^ TOEFL 
total score adted to set of nontest \^rlabl€s. Easults of two 
alyses are prodded in ^dAe 13, "Om first row for aadi groyqp dvws stmdacd 
partial re^assion ooafflolmts for the sevm ncn-test %^riaUes t^y, and tim 
c^rra^gndii^ miltlpla correlatlcsi raeffieimts; the saomd ro^ tiK^ oox^r- 
stlm dktm for Uia ^ilysls in vMdi t-1 TOOL total score wis ^dad to tim 
battery^ Ihe four non-tast varlaUes mtii tte higl^st partial r^ressicn 
(beta) \gei^ts ara indioatad by bold-face ^pe» 

lha gmaral-sanple results in ^Ua 13 indicata Umt the same set of four 
variaUas (Itdtsts^ ttib^l^ Qmter^^S., andA^) rantributad most to pre- 
di^ion of cfamga in WEFL total in both the amlysis mthout ocntrol for t-1 
score and that vdtii ^ntrol for t-1 scon. However # Introduaticn of control 
for t-l score renltad in dw^s in the relativa mapdtudas of the regression 
ii^il^ts for these wricULesi ^tticut central for t-l score, lUAsts and Cm- 
tar-U.S ara mDra heavily ^i^ited tim Xnbcvl and ^ga# but mth control for 
T-1 soorar Ctater^.S. has a sli#itly lo^r wel^it ttoi Intrvli Hutots and 
^ge ccntributa Btaaat aqualiy to predicticn, vdth sli^tly lowar ^ig^te thBn 
those for tim ]Qitcvi and Ombar-4l.S. Sm (H ^ 1, F » 2) mm nagptively 
mi#ited in tha total sa^.a anadysis, as t-1 soore ri^ortinf (B^orte), 

With sGma variation in detail, Uie ^ttecn of findlmp in the respective 
malysis groups mm sJjdlar to ^$ pattern described abo\rai vAti^ oontrol for 
the t-1 score, ^bitrvl msre heavily v;al^ted Omi either lUatsts or 

Gmbar-U^S in 10 of 13 grc^jj^, but mttwut control for t-1 score, altiier 
HmttitB or Qntar-U«S. mm siora hra^ly weig^^d timn Utevl in 11 gro^s« iax 
mm Mgptlvaly wl^t^ in t®a of tiie Malysis grojps, with control over t-1 
soore, and the flndtags for R^irts ^ra similar. 



Bdative Qantrifaution 'l^ttltfd^ to Fceflctian of t-7 ?OMal Score 

^A^la 14 diows the results of regression at^ysas in vMdi the ^^^idmt 
variiAda mm the last 'JCW% total score of record (tte t-T score), and the 
indapafident variables mtm tite t-1 score and tte sevm Krh-test ^^rli^es« 
S^C^a t-V%^ mrralations and iwjlti^e correlation ooef £ icimts ara shorn, 
\dth standard partis ragrassion ^ig^ts for various ind^mdmt ^mrl- 
aUas, As ^^erted from ti^ pracecUTq m^^is (in vMdi dian^, t-1 to t^, 
mm ttm dipantent wrliULe), (a) In^usiai of tiie latest varices resulted 
in a^anc^ pre^ctim of the t-T total srore, and (b) Age, isArvt, CMber, 
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ImlU d the Niriislsii of mmi^ - t --T - M) in TQIFl mn, on Silictid Fitssail, Mikik, and TeicinHilatid 

Vsrlibiii! JfiSh ind wlihout Control for Tiei-l ToEal leeri, bv /iiilfali Group 



Group 


N 


1 

Tetai-l 


a 


1 

Ititr-vl 


Stindard partial rigreailsn wikhe 

4 1 " " a" 

Ginur Edlavil A|i 


s 

7 


8 

Reports 


Hultlpli 
cdf rilil 
I 


01 


3070 




.200* 
.Oil* 




.081* 
,070* 


-.008 
<060 


-,078 
-.124* 


-,Qs|i 
-.Of it 


•.110* 
-.037 


.278 
.410 


02 


1781 




.301* 
.Ul* 


,lS6* 


,100* 
,098* 


-.013 
-.037 


-.015* 
-.103* 




-.048 
-,026 


,436 
,524 


03 


S19 


-,]1! 


.176* 
,116* 


,iS2* 


,124* 
,094* 


.004 
.084 


-.074* 
-.110* 


-op! 


-.023 
-.012 


.338 
.450 


0^ 


733 




.116* 
,105* 




.213* 
,17e« 


-,016 
' .015 


-,081 
-.117* 


-.On 

-.O^i 


-,098* 
-,047 


.430 
.517 


OS 






.262* 
,207* 


.269* 


.213* 
,133* 


-.026 
.00$ 


-.104* 
-,012* 


,qei 


-,051 
-.030 


.532 
.585 


Oi 


313 




,121* 
Mi 


,0^7 
,1^7* 


.142* 

.086 


.069 
• 111* 


-,061* 
-.130* 


.0}] 


,068* 
.100* 


.205 
,446 


07 


lU 


^,319 


,179* 

,110* 


JQb4* 


.126* 
.078* 


.001 
.026 


-.056* 
-.108* 


•m 


-,010 
,035 


,356 
,462 


oa 


1500 


-.420 


.20S* 
.137* 


.^l 4* 
.2£:4* 


.010 
-,042* 


.002 
,009 


-.026* 
-,107* 


-.0Q4 


-,079* 
-,024 


.364 
,540 


A A 

09 


S§i 




,251* 
.114* 


,n 8* 


,215* 
,133* 


,054 
,133* 


-,159* 
-,212* 


-Ml 
•Mi 


-.013 
.023 


.434 
.601 


10 


48 




,165* 
.166* 


g* 
.az 1* 


,219* 
,257* 


.010 
-,Q21 


-.182* 
-.117 


-,m 

-.130 


-il26 
-,1S5* 


,382 
,430 


11 


101 


-.392 


-.Sii* 
-,0i8* 


,16 1* 


,475* 
,39i* 


.012 
,098* 


,103* 
.016 


-M6 
'Mi 


-,032 
-.026 


.503 
.622 


12 


307 




,339* 
,215* 


,26 J» 
JM* 


,218* 
,124* 


-,014 
.040 


.069 

.006 




-.088 
.024 


,572 
.649 


13 


214 




. 205* 
,107* 


^93^* 


.048 
,047* 


,033 
-,030 


-.106* 
-,102* 


■'MP 
Mo 


-,043 
.027 


.246 
.445 


total 


11612 


n3S? 


,llO* 
.111* 


.1U» 
.153- 


.253* 
,142* 


-,026 
.016 


-,098* 
-,119* 


-.076 
-MS 


-.056 
■iOOi 


,412 
,525 



H2|i. mpitmm^u Oj (Uto). 02 iUn Kong), 03 (Koria), 04 (Thiiland), 05 (jp), 07 (Saudi Arabli, 
Kuwait, JoEdan, iilaiien, Itsq, $niiU ( Iria), 09 {Chlh, doloibla, Mixleo, Piru, Ummli), 11 (Griica-Turkiy) 
06 (ladii), 10 (Oetiany, Nethsrlirt, U^m&n, Ner»ay, iwidiii), 11 (Franai, Italy, Ipsls, FoEtugal), 13 (shana- 
Hijirli). Ughiit rt|fissloM5l|ii^s for nonEisI varliblii. 

> o S3 
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.131 
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il37 
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iQej 


"Oil* 
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.130 


.113 


-iOlO 




OP m 


.702 
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.234 


.105 




n 307 


.147 


.m 


.213 


ilPO 




01 313 


Ml 


,030* 


iOJ? 


iOii 


Mi 


10 


,m 
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.130 


tl3Q 




11 101 


.753 


-.079 


.m 


i31? 


m 


13 m 


J31 


.682 


-i028* 


ml 




m iim 


J75 


.088 


.117 




i013* 



Wisslan viiihts 



(Korea), 04 
Oslsibli, Un, 
iUly, ipilD, 



Aaaiyiii m based qh a 10 pafeint fiadsi galpli 
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7 



lal), 13 (Ghipa41§irla), 
* Viriibli ediilflbytis Im thas ,001 to Nqmrid In E (£i,iJ6?8), 



Agl 


lit 


RlDOfES 


ET.fl-l 


i) ftl.(f) 


=*oa8 


-Ml 




,m 


M 


-.077 


^.031 


»iOiO* 


.169 


?in 


-i072 


-.0§Q 


-iOQP 


.814 


.780 

i ! 




-.02? 


^034 


.777 


i723 

i I fej 


-.041 


.013* 


-.020* 


.844 


.797 


=.08? 


-,01|# 


s02§* 


,701 


.657 


Inn 

-.102 


=,00|* 


*,022* 


Ml 


.51J 




-.012* 


01S4 


778 




.004* 


■i044 


=.019* 


.7P0 




-Ml 


i017* 


,050 


.S$3 


.884 


-.068 


=.07S 


-,Q?1 


.84$ 


.61« 


i0l3* 


^021* 


=.020* 


.77S 


.700 


-,07f 


.000* 


iOai* 


,723 


,714 


=i0P2 


=.050 


=.006* 


,755 


.np 


repiaterSi 


Groups 


ire 01 (Tilwiij), 


02 (Hi 




, Ireqj Jq 


fdiB, IP' 


li), 08 (Ifin), 


OP (01 


lile, MW^ 




'land a, 


aigrki Hoivay) E 


vedenli 





OTd Njmtats^ \micm tte major nmteet OTitrlbutors to predictiOT m k totlJ' 
ms^plm and in womt of the malysis groi^, 

N^Eter of tunes tested be m indiract TOasure of effort^ wllvati^# 
fii^gial resources ^ praGtice effects^ so en. A langer intef^ bet^n 
tastings nmsm greater ^tmtial opportmi^ for stud^, practi^^ ^ generii 
naturaticn of proficiOTcy in B^llrfi* ii^viduals vdth lon^r W to t^5D 
intervals ara OTlikaly to be M^gad in incisive toglirt laflgisifi stuc^ 
Airing tea Qitire inters betwean tto initial ai^ tl^ final teityj, Ifa^ 
ever^ it is reason^e to asmine tiiat batwem ti^ first md ti^ W tsit 
admin stration rost I&SC repeatars are motiva^^ in part^ by a e^ni^ ganatdl 
objective^ nmely, to litprove their mglish proficient as ai alafl^^tln th^it 
ovarall proration for , stud^ in tto mited Stotes, Freeisaly % 
tami^ this objactive carrot ba ^tanninad from tl» datai ta^r etfi%JtoitiQn^ 
fin^^gs indieaihe timt boUi mnitoar of tires tasted and tifffe Intarvsl 
be^«?a^ ii^tlal and flt^ test arfministratiofis raitribute inaap^i^otly 
dimga in TC^L maorm, 

tta positiva cantributiOT of baing in the Unitad States at ti^ of final 
testing iffidoubtaa.y raflacts, to scne extent, Ute Inpact of t^^ genairal 
language eiivironinent an language learning* 
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aeatien Vlll. ObmBt^mA vb S^^cb^ fto^ (t-JT) TOEK . Tbtal Saoce ftens^ 

by ^i^ysie Groip 

Rasults of tite tot^-^n^a regresiicn analyses provide a basis for 
fOTiratii^ mpm^mi mm t-T scores for the re^eotive Kmlysis gro^^ airi 
detemlnii^ Urn octant to tteir ^^rved t=T total aoore mwim are 

ecnsist^t witii e^^otation. usinf tta totol SM^e refraislon equation, ^Aildi 
takes into accent ti^ t -1 sco re ^ IMft^, mtrvl^ Omter, Ba.e\rl, ^ge, 
and B^rts, ^^actad t-=T TOEH total scores (t-T ei^) were coi^ted for aa^ 
of the 13 malysis groi:^* "Om aquation mm as follow: 

M ^ - .737 (^1) + 3.416 (IMO^s) + *681 (tetrvl) + 12.637 (tenter) 
+ 1.369 (BffLavl) ^ 1,107 (^) ^ S.877 (Sax) ^ .647 (toaorte) 
+ 167.391. ^ 

For a caqposite tosad m Uiis aquatioi, R m .755| ttie sta^rd error of esti- 
mate (fcr Indl^djal prerUctri t-T scores) is aH^rraiinately 3S scaled score 
points. 

^tole 15 stov^ the actiMd t-1 and t-JT mems^ tto ejected reans, md the 
dlfferance betwe^ the two (es^^^tad Mnus actwal)^ by amlysis gro^p, Groi^ 
are ordered in tams of tte al^braic difference (hi0i ^sitiva to hi^ ne^- 
tiva) betoeOT actued and ^^cted t-T iteiff^. 

o Otearvad t-^ wBSim wre 10 to 24 points hi#ier Uian esqpactad, based on 
the gemrad sas^e regression equation, for gr^^m 10 (Geroary, Netherlands, 
Deraark, mtm^, fw^toi), 11 (France, Italy, ^ta, Portupa), 12 (GreeM, 
TUrk^), and OQ (Chile, Golonobia, MeKico, Peru, ard VOTemela). ^^ugs 10 and 
11 had Uie two higjiest t-1 means, md wre ^racteri^ed by low inciteice of 
r^^tition; ^cups 12 mtA OS eatrparatively Iw T-1 meads, but wire char-- 
acterized by only inaterate incitoice of repetiticn. The Iranian and Korean 
rapaaters h^ t-^ raeans batoseOT 2 wA 6 points higter than ea^ctedi the t-l 
means for these two gra^ differed by ^out 60 points. 

o Sanpla t-T means for ttalland md were betwBOT 4 and 7 points 

l^^r than ejected; ^nailer ne^tive resiAaals are evident for repeaters from 
HOffig Kong, Japan, Qiana-Ni^ria, end tiie Ar^lc-^eaking Mdeastem couffv- 
tries (grwps 02, 05, 13, msA 07). For Miari rasters (groi^ 06), the ex-- 
pected mean and tiie actual mean vmtm the same* 

In ev^uatir^ these findings It rfwuld be kept in mnd that tte analysis 
mm not coritrolled for "interv^iing es^rimce" variables that mi^t be ex-- 
pected to affect groferth in ^^lirii proficient (e*gp# mamt, int^si^, and 
quali^ of total E^llri^-language inwl^wient between initial md final test- 
ing), mth reject to v4ii^ tiiere nay Imve l^mi (may tend to be) avarage dif- 
ferences among ^^minees from camtries making ^ ttie anal^is groips 
TOnsideration* 

Ttm fact Umt Eurc|^an bdA Seutii Mieri^i rej^aters had hi^er gains 
relative to e:^ctaicy ^mx those from, say, tiie Mdeast, raiTOn, or Ja^i may 
reflect differences in the amount, intensi^, a^br ^jali^ of totol 
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Tshia li 

Discrepancies Between Obaervad and EKpigUd T iaa^T TOEFL Total Scores^ 
Analysis Groupi Expectad Score BaagdonAa All-RepeateF Regression' 
Equation Employing Tlma-1 B^otm and Sey Hontesc Variables* 



Oroup 


N 


(1) 
Tlae 1 
saCual 


(2) 

actual 
C '^afl) 


(3) 


Dlf f era 
(2) - 


11 


101 


530* 7 


565*9 




+ 


24,0 


09 


566 


465* 8 


SI3 * 9 




+ 


12* 4 


12 


307 


47Sp 0 


518-0 




+ 


10,4 


10 


48 


531,7 


555,7 


^ ^ 1 


+ 


11,6 


U 11 






HoB m 0 


483,0 


+ 


5*6 


03 


819 


495,5 


521.1 


518, 8 


+ 


2,3 


06 


313 


501*2 


526,0 


526,0 




0,0 


13 


214 


490, i 


51?, P 


SIS. 7 




0,8 


OS 


1346 


470.3 


503,1 


504,2 




1,1 


02 


17til 


439, 5 


519,0 


S20, 1 




1.1 


07 


814 


439,7 


485,4 


^§7,3 




1-9 


01 


3070 


490. 7 


501, a 


^12,7 




3*9 


04 


733 


454*4 


485*1 






7.4 


To£al 


11612 


474«6 


SOSpf 






0,0 



No£ei Analyses are based on a 2O percent sample of all repeaCers* 
GroupSp ordered In terms of aatual flinus expected means, are 11 
CPraaae, tfly^ PorCugal» ipalo) , Of <Obiiei OolsBbla, Hexlao, Paru. 
Veaezuala) , 12 (Greeee-Tarii^j) ^ 10 ( ^ifaaay- , Natharlaads, Dsaaark, 
Hdriraj, Svedea) , 08 (Saudi Arabia, HCmit^ I.sbaaaa, Iraq, Jordaa. 
Syria), 03 (Eorea) , 06 Clndla), 13 f Obns-ir igar la) , 05 ( Japan) , 07 
(Iraa), 02 (Hoag Koag) ^ 01 (talwaii), O4 (fhallazid). 

* BzpeeCad t-T - , 737 (T-l) + 3.4l6 ihrntst^^ + ,681 (laCrvl) 
12*637 CCaatar) + l*3^Mgdl»vl) - 1,107 (Aga) 
- 5*877 (Sax) - *647 d^iportsji + 167,391, 



68 



B^glish-language intervening esqperience that may be dmracteristic of cCTitin^ 
goits of exandrees fron these regions or comtries v*io either elect, or are 
required, to repeat TOEFL, Such a ^ttem of firriings my also reflect, to 
scroe axtent, differences in the degree of "linguisti«iistance" between 
gli^ and the varioim Imguages in^l^^, and asiociated differences in (imr- 
acteristie rates of ac^jlsition of proficienq^ in English as a secOTd 
la^uage* 

Wia potential value of living in an Engliih-speaking envirament as a 
iactor infli^ncing score gain is ^ggested by the finding tlmt repeaters T^^ose 
last test v?as token in tiie Uhited States gained o^r 12 points mora, on the 
average, than those tested ^tside thm Uhit^ States mtii similar scores on 
other stu^ variables. 



EKLC 
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Beation IK, T^idm^ to R^eat as a Enjrctian of initio Boom 

It vmB BBBumd at tiie outset that the te^ency to repeat TOWL is related 
iirrarsely to initial soore le^^l, "Am findings timt Ymvm been reviewed are 
generally oOTsisttnt mtii tlmt astuirptiOTi, For exOTple, araig first-^tiire test 
takers in all analysis groi^ the mean initial scores of tiiOBe identified as 
repeatars were ^t^natically lower than thoie v*io were identified as non- 
repeaters (see Figure 4)j the four aml^is groiips mte lowest percenta^s of 
repeaters \^re ammg tte six wth t^-l means Btmm the median for all analysis 
groups ^ vMle oi^y two of six groins \4th ^□ove-^nedim percenta^s of repeat- 
ers vjere above the t-1 median (Itole 4 related disoussim); amd, ^tiiin 
each analysis group, for miogtcnjpB claisifi^ ^ nuttber of test repetition, 
t-1 means tended to vai^ inversely mtii mattoer of tiros tested (see Figure 2 
and related discussion)* 

Bie analyses irwolved, hcwever, did not examine the nature of the re- 
latiOT^p between ter^enqr to repeat aM initial score levels ^toin tiie 
^rious analysis groi^. ^blm 16 ^am Uie percentage of re^ateri araig 
£irst--time test t^ers classified by TOEFL total score interrals mtiU.n var-- 
ious analysis grm^s. For this anal^is, certain of the 13 original analysis 
groyj^ were COTSolidatedi I = Hmg Kong ari Korea ^ ori^Ji^y 02 otS 03 i li = 
iaiwan, Ol; III = Ihailand and Ja]^^ 04 Md 05| w = toudi Arabia, Kuv^it, 
Jordm, Lebmm, Iraq, ^ia, and Iran, 07 08,- v - diile, tolraibia, 
mxico, Peru, Venezuela, aid Greecfr-Turkey, 09 and 12| auid vt ^ Oermarqr, 
Netherlands, Demark, Norway, Sweden (10)^ France, Italy, Spain, Portugal 
(U)^ Qiana- Nigeria (13), and iMia (06), ccntoined* 

Trends in the data are portrayed graphically in Figure 5, ^-^dh plots tiie 
percentage of repeaters at successive score levels for eadi of the six 
analysis groL^^, ihm mean t-1 score for the repeaters is ir^icated, 

Q It is ^uta ^ear that ti» trnde^y to re^it ^ea not a sio^e luisar 
inverse firction ©f iMtial soora Iwal— Uiat is, tiie ter^enqr to repeat did 
not increase (decrease) regularly as initial score Iwel declined (increased) 
throu^iout tile score rmge in any of the anal^is gra^s, Ratiier, the tender 
CY t© repeat appeared to be im^rsely related to score level only in tiie \j5per 
score ranges. For exaiq^e, in all groins betareen score-level 490 wa^ score- 
level 590 (representing interwls 480-499 throu^i 580-599) an im^rse rela- 
tionship tended to obtain* Across a larger intervening score icmga (rou^ily 
490 dov^ to 390), fh^ ^^^rsdency to repeat ras relatively strfsle, percent 
repeating tende^^ € > v^^if l^^rotly mth score le^l in the lower score raiges 
(bf flow 390) — xettf?j^- ing tended to decrease as scor^lewl decrease d 
across scorg ^ im-:r^t.=»t?,'^>. , 90, ~ ^' ~~ ™ ^ — ^ 

© It ii^ A v v I u^^it tiie tgnrtemy to repeat ws strenp^y assoaiab^ 
wltfa OT^ysi^ >i ^.-^sMp. For exatple, except for examink ^ in the two 

lowest score interv^^si reports, Asian eKamdnees (groins i^ 11, and iii) \gere 
midi more inclined to repeat Uian exminees in otlier grotpsi toose f rem Thail- 
and ard Ja^n (groLp III) erfiibited relatively straig terdencies to repeat at 
score Im^ls Qtmm 540 and below 340* On the other h^^ pr^rtionately few 
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Table 16 



PercenCaga of Flrai-Tlraa Test Takers Repeating TOEFLi by 
Initial Saore^Level and Analyals Group 



TOEFL Total 






Consol Idated 


analys Is 


group* 




at Initial 


1 


II 


III 


IV 


V 


VI 


Total 


Ces % ing 


% 


I 


I 


% 


% 


% 


% 


580 + 


19 • 9 


15 .0 


30-3 


6.0 


5.9 


3.5 


9.6 


560-579 


25,6 


21.5 


34. 9 


6.0 


7.7 


4.5 


13,6 


540-559 


35,0 


33.0 


36,6 


7.7 


8.4 


5.9 


19.8 


520-539 


46 .1 


45.4 


43.7 


11.6 


14.4 


7.3 


27.9 


500-519 


48. 3 


48.4 


46.8 


16. 0 


16,2 


8«4 


31.1 


480-499 


51.1 


57^4 


56.4 


25.5 


32.5 


ITTT 


39,5 


460-479 


53.2 


55. 3 


52.3 


28. 7 


38.1 


14.4 


40»4 


440-459 


48. 2 


51.5 


51. 3 


30, 3 


32 .7 


14.0 


38.0 


420-439 


44.4 


44. 7 


51.1 


28*5 


34,7 


12. 8 


35.6 


400-419 


48.3 


45,0 


47.5 


29,2 


33,3 


18.2 


35. 7 


380-399 


46«5 


42.0 


45.7 


28. 3 


34.9 


5.9 


33.0 


360-379 


40.6 


35,6 


40.0 


26.4 


32.0 


7.1 


30. 1 


340-359 


15. 2 


4.5 


43.4 


13.0 


26.0 


9.5 


25. 4 


< 340 


6.8 


** 


25,0 


19.1 


21.0 


33.3 


20. 9 


Total 


49,7 


44, 5 


47.1 


23,5 


22.1 


8.6 


30.0 



No te I Ddderssorad an^rles ladlaaca pereeBtagea of repeatafa 
la ttaa seora iatawala tba£ Ineluda the aaaa t-1 acora for 
rapastars La the daslgBatad groupa. 



*I (Hong Kong and Koraa^ original 02 and 03), II (Taiwan, 
01)p III (Thailand and japan, 04 and 05), IV (Hiddls East, 07 
and 08), V (South Aqariea and Craaee and Turkay, Of and 12), 
VI (India, Nlgaria and Ghana, Europa-Oermanlc , Europa-Ro- 
manoa, 06, 13p lOs and ll>s 

^^Lasa than five casas. 
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Figure 5. Parcentage of repeatirs by initial score 
level on TOEFL for consc'jdated analysis groups 

(20 pircent sannple) 
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33 37 41 45 49 53 57 
35 39 43 47 51 55 59 

TOEFL total score divided by 10 
l(4idpoint of t"1 TOEFL total score interval 

Entries m ( ) are original analysis groups (see Table IS). 



gro^ VI (Eurqpeai^ Indian^ African) exaninees^ regardless of their initial 
score le^ls, repeated TO^T^, toreove^ irost of tiian took one additiOTal 
test, iflMle thi Asim contin^its incl\xied ma^ nultiple repeatere. 



Bam Uwee^ved QuesUans 

Khat accomts for Uie differences among amlysis gtoi^ in incitoice of 
repeated test-taking hy initial score levels on shcR^d OTly about 

10 percent of group m ^^ninees mto initial scores sli^tly above 500 repeat 
raiFL as Gcnpared to etmt 50 percent of mim eKOTdnees at timt initial score 
level? Do these differ^oes in patterns of test taking Miavior reflect dif- 
ferences in "perceived proficiei^" for examinees at thm sane score level vjio 
have different linguistiMultural backgrourris? Do U.S.'-bou^ Asian stu^ntSi 
as conpared to, say, tiieir Eurc^an counterparts, perceive le\ml of score on 
TCEFL to be more critiral to ttie realiMtion of their plans to studKr in tiie 
Urtit^ States? 

To ^hat Mtent are differences in lewl of perfomance m tOE^ associ-- 
ated mth differences in the realization of re^rted plans to sti^fy in Uie 
Uhited States or anada? Me ttere differences in realization of plois for 
noi>-repeating and relating ^aminees? Resear^ is need^ to an^rer questiona 
such as these — questiOTs that relate to the ^namics of test repetiton in 
differmt sutogroi^ within the pqpidation of Intematicmal and Special Center 
^candMes* Besearch is also needed to provide empirical evidence recording 
the relationship betwe^ scor^levels on TOH^ aM neamires of Uie ftmotional 
cctnrainicative ecKpetmire, general airi acadanic, of ESL students (facul^ 
ratings m^/or self-ratings, for exOTple)* 

V&mt acGOimts for the apj^rent decrease in terKjen^ to repeat aiamg tiie 
lowest scoring first-tine Internati^al and ^cial Center test takers? Cm 
plausible t^othesis is that mary of Uiese very low^scoring indivi^als yjho 
apparently did not repeat the TOEPL in an l&SC administratiOT (a) may have 
enrolled in ^3ecial ML prograr^ in order to inprove ttieir English proficien- 
cy, and (b) may have met institutiorml requironents throu^ sucessfaL ccnpl^ 
tion of those progrMis rattier thm by foraal isSC testii^. 

It is also possible that mary of Uie lo^proficiency I^SC nOTrepeaters 
may have repeated the TOTli again (imofficially) in test acininist rations su- 
^rvised by individual instifaitiOTS as part of Uie TOE^ Institutional Tfestlng 
Frogram (EIB, 1983). oS^re uridoubtedly is some, perhaps cOTsiderable, overlap 
between the Institutimal (noST) ^ i^sc examinee popalations, At present 
tiiere is to basis for estimating the extmt of mov^ent of eKaminees between 
the tm testing progr^ p^jidations* Howtver, ororlapping (or irovement bfr» 
twwn p^^atims) may be dm to a wriety of logical patterns of test-taking 
b^avior, 

itov^nmt betoeen the two pcpdatiOTs is prdb^ly most ^aracteristic of 
first-time wmL takers (in I&SC or institutional test adninistrations) vAtii 
low scores. For exanple, lov^^scoring first-time I&SC ex«dnees may enroll 
directly, vdUiout ftirther l&SC testing, for ESL instructioi and take TOEFTj a 
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second tim (or more) as part of an institutiCT^ test ^ndnistratioi, mth no 
ftirtter Med to take l^PL in an official l&SC teat aAdnistratlim. Sudi a 
TOdel would telp to aceomt for the fii^r^ that ei^tantial percent- ages of 
very low-scoring first=-tiine isSC test-»takers (for eKairple^ belo^ 400 m TOmL 
totri) did not r^at as ISSC exianinees . On ti^ other ha^, forei^ mtiMals 
with recogpiigedly limited mgiish proficient imy airoll in apecial ESL 
TOursea, and tBkm TOEFL for the first time in m InatitutiOTal test aA^ni-- 
stration, limy later ^e TOBPL a^in, but in m l&BC ^ministration to 
eypport an a^^lcaticn for ainiasiOT,* 

toaearch ia meo^ to establirii both tte degree of overlap areJ tiie t^i- 
cal flow patterns between the Uie ISSC examinee po^atiOTs, In tiie 

meantajne^ preliminaEy evidmce regarding patterns of ahort^term teat=^taking 
behavior and score diaige for examinees tested for the first tine as ^rt of 
m inatitutional testing pr^ran is provided in the final sectim of tiiis 
report. 



* The TOEFL teekgroiroi cjoestion re^rdlng previous e^^rience with TOEF^ is 
cpen-ended vdOi respect to the auspices of previous test achiimstratiOTa, 
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Xnstitutiarially Iteted B^eaters 

Bts preoeding sections provldad infonnatiOT re^rdlng test^t^lng pat- 
terns and romL score ^angt for es^animes in 13 aialysis groi^ (based pri- 
marily on camtry of origin and native langimge) (a) vto ymtm tested for the 
first tiro in an intematicnal and ^cial Cmtar (isac) test i^ninlstratiCTi 
betweOT July 1977 md 0^ 1980^ md (b) \i*iq had taken TmtL again at least 
tiro in an l&BC test ^tainistratim as of jime 1982, 

^out 28 percent of first-tune I^SC ei^fflrf^nees aoMulated at leaet orie 
additicfial test record ^thin 24 to S9 mmths after initial testing (iteai 
mattoer of tests ^ 2.9). tte avera^ inters belMen tt^ initial the last 
test administratim ms a^rcocimtely one year (neati ^11.9 non^). raking 
into account the iMtial (time-l or t^-1) total score, tte wri^les raitribut- 
ing inost t^ predictim of final total score (time-T or t=-T) were (a) being in 
the U.S. at tim tirae of testii^* (b) tine intenml between t-=l aid t-T, ^ 
(c) nmber of times tested. Ag&, and educatioral level at tine of 

initial testing also caitribitad to prediction after raitrol for initial 
score. 



Ihe average ammait of change in T^FL total score, ^^r t-1 to t-T inter'- 
vals averaging about one year, vmrn 32 scaled score points, but there were 
s^tantial differences ammg tlw analysis groi^ \dth re^ct to mm change 
(from 18 points to 54 points) . Sane groi^ had st^tantially hi^fier t-T ireans 
than esqpacted based ot tteir t-"l scores aid otter wriables fomd to be 
associated with diange (tine intend ^ mflriber of tires tested, locatim, and 
so cn); for other analysis groi;ps, the opposite mm true. 

Berai^e of tte of thm follov^i^ period, the observed average score 

diange for l&BC repeaters betaeen the initial and last oteerved test ailtdni- 
stratloo my be thouc^t of as reflecting "lOT^term" change associated witii 
(a) twwre as maitoers of the isK ^candnee p^ulation for the nejorilqy of re- 
peaters and (b) the entire range of cmditi^m that normlly prevail beta«?een 
test- ings for individi^s repeat roraCi in mm adtainistraticns. 

OMs section presents infonnaticn re^rding "tf^rt-tenn" TOm% total- 
score diange for eraminees i/*^ are ass™^ to have been enroled in programs 
o£ iTOtruati^ in inglirti as a second laigi^^ betwem testings. Based on data 
in T^TL files, tte exaninees involved (a) v?ere tested for the first tims by 
ai institution as part of the Tmn^ iMtitutia^ TOsting Progran totoeen OiAy 
1984 and July 1985 , (b) inaicated that th^ wre gaming ifl^&rgraduat^-level 
or gr^uate-level study in tiie U.S., (c) wre in mm of Vtm anal^is groi^ 
defined for tte staj^, aid (d) tow identifi^ as repeat- ers mtMn Uie stuefy 
period— that is, th^ had accmnulat^ at least ma aMiti^ml test record as 
of ^g^t 1985 , in ai mST test a^nistratim, within one to 12 nmtffi 
f611ovn,ng mxtial testing. 

informtiOT is mt avail^le regartttng the actual status of the Institu- 
tlOTally tested repeaters, their experience between test ^tdnistratiOTs/ ccav- 
ditions gweming qpportimities for eMndnws to ta^e TOTOL, and so m. 
However, it is asMngd that most repeaters oj^mrily are mroUed for 
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intensive instruetien in Bngli^ as a aeeond language end that their test 
records are obtalr^ as part of institaitiOTal testing prograM ^sigped (a) to 
assess read far ^>ecial imtruGtiati in Engli^^ (b) to a^ilmta progress 
following sore period of ESL Instructim^ ancl/or (c) to certify tto "readi- 
ness*' of tim indi\ddh^s to wriert^e aoactaia ii^truaticn in Bnglish, 

If tt^sa assuapticm are ^mlid, a^^rage diangts in TOBfli perfornanre for 
institutiOTally test^ r^aters my be tlw^^t as reflecting prinarily 
average gains* o\^r speoifled periods of time* associated with participation 
in the t^ic^ rai^ of program of formal instruction In Briglish as a second 
language that are offered by the InstltutiOTS ifR^olved. In ary event* it is 
reasorahle to assure that "intervOTing conditiOTs" are more conparable for 
individuals and for analysis grw^ in sables of ID©T "stort-tem" repeaters 
than in smiles of "long-tem" I&SC repeaters • 



Description of the institutiOTal tota 

A total of 10*055 indi\d.di^s fron tte oomtries included in the 13 basic 
affsalysis grea:ps defined for the study were identified as firsts-tine TC^FL 
takers in an institutlOTal test adWnistratiOT* faring ttw period ^y 1984 
througfi August 1985, First'-tline MOT es^mineea* ^^rally, and in the great 
majori^ of analysis groups* had lower totTj me^^ ttan S,d their isSC rasit-- 
erparts (teWe . 17), fiiis is COTSistrot witii tim as&jnption that need for 
special instruction in Sngllsh as a second larguai^ is associated with 
mOTbersMp in the p^iaticn of iratitaitioi^ly tested ex^idrees, ^^reas toe 
I^SC eacaminee p^ilatlm incluaes indlvidyals with hi#i as well as Iw levels 
of proficimqf in miglish, Urn few exceptiOTS to this^ttem in Mble 17 may 
be Am primarily to sampling coMi^ratia^, 

Bom 10*4 percOTt of the ITOT ^«minees were identified as repeaters— 
that is* were tested again* ore or more times, vathin one to 12 mmths follo^>^ 
ing tte initial testing (see raWe 18),* — ^ 

Bie overall incicteice of repeated test taking was lower in tl^se INST 
sattid.es than in tte MK sanples (atout 28 percmt) and tim incidence of 
nultipae test taking (three or more teste) was lower as well (co^^re date in 
teKLe 18 witii date in Trfsles 2 3), itoi^ver* sudi differenres are 

cansistmt mth es^ctation. 

© Ohe isSC date reflect test tdting in regularly sca^*d.ed ncnthly test ad- 
ministrations; individuals tested for tim tint tim during a three-year 
period had from 24 to 59 mmths in to re^at the in a regular-- 



*nir» mal^is groq^ were represmted by fewer than 10 repeaters i Gro^ 06 
(India)* Group 10 (Gennany, Frmce* Ifetherl^ids* Deranark, Norway* Svreden)/ mi 
Group 13 (Qwia* Nigeria) * ^te for repeaters in these very ^all groi^ are 
not repDrted separately* b^t were inclined In analyses involving repeaters 
without regprd to analysis gron^. 
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Table 17 



Mesne of TOEFL Exaalnaes Tested for the First Time by An Instltutlpn 
Aa Goiipared to Means for First-Time EKaalnees Tested in an 
internaC lonal and Special Center Adialnls Cr at Ion 



group 




LlsteslDg 
Qmmp 


Structure 4 
VrlcCeQ mmp 


Semdlng Coap 
& VOQsbulary 


Total 


01 


z&ic 

IKST 


36877 
403 


50.3 
49. 5 


49. 2 

47.7 


50.8 
48.2 


505,4 
484.8 


02 


I&SG 
INST 


23203 
175 


50.5 
51.8 


49.5 
48.5 


51.1 
49.7 


504,1 
500,3 


03 


I&SC 
INST 


10171 
898 


49, 2 
45.3 


49.6 
45,3 


52.0 
46.0 


50a.5 
455.3 


04 


14 SO 
INST 


9463 
450 


47,9 
49, 5 


45.3 
47. 4 


45.9 
47. 4 


463.3 
481,0 


05 


i& SG 
INST 


14215 
1891 


48.9 
45.4 


47,1 
45. 3 


47.5 
44.3 


476.4 
450.1 


Op 


I& sc 

INST 


19157 
96 


54.0 

53.2 


55,8 
51.2 


57.4 
51.8 


557.2 
521,8 


07 


i4sa 

INiT 


20830 
3165 


49.4 

47.1 


44.4 
42.5 


43*9 
41.2 


459,0 
436. 8 


08 


I&SC 
INST 


31594 
705 


49.1 
48. 5 


43.8 
44. 8 


42.8 
43.6 


452.5 
456. 3 


09 


i&ic 

IHiT 


15018 
1478 


53,0 
49.4 


47.7 
45. 7 


51.7 
48.4 


507.5 
478*4 


10 


i&ic 

INiT 


6449 
139 


60.9 
5S.8 


56.4 
54.6 


57*1 
54.2 


581.5 
558,9 


11 


i4io 

INST 


6131 
343 


5.1 
53.0 


54.3 
51. 6 


^3.0 


558.3 
525. a 


12 


I&SG 

iNif 


746 
236 


53.4 
49. 7 


49,3 
46,5 


49*4 
45,4 


506,7 
472,1 


13 


i&sa 

INST 


20886 
76 


49.1 
52,6 


53,4 

5a. a 


53.4 
50. 5 


519,6 
517,8 



^^fe^* Groups arei 01^ (Taiwan), 02^ (Hong Kong)*! ^ (Korea), 04_ (Thailand) » p5 
(japaa)^ 06 (ladlaj 07 (iaudia Arabia, Kuwait, Libya* LebanonV Syrian jordanT 
Iraq), 0B_ (Iran), £9 (Hexleo, Chile. Colombia, Peru, VeneEuela) , 10 (Gerfflany, 
Netherlands, Norvay, iwaden, Denmark), France, lealy, Spain, Portufal), la 
Greece, Turkey), 13^ (Ghana, Nigeria). The INST samples are from TOEFL Inst^ 
tutlonal Testing Frogam files for July 1984 throufh August 1985| the I&SC 
samples are those from International and Special Center files who were tested 
for the first time between July 1977 and June 1980« 
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Table 18 



Firsc Time Ins t ICu C lonal Teat TakerSp July 1984=AugusC liiS, by Analysis 
Group mnd Number of Tiaes Tested During Che Study Period 



Nusber of times tested 



Groups H 


1 


2 


3 


4 


5 


6 


01 


403 


369 


26 


4 


4 






02 


17S 


162 


9 


2 


2 






03 


893 


762 


117 


16 


3 






04 


4S0 


401 


38 


9 


2 






05 


1891 


1591 


210 


64 


17 


4 


5 


06 


96 


89 


6 


1 








07 


3165 


2901 


180 


50 


23 


9 


1 


08 


705 


631 


14 


17 


3 






09 


1478 


1358 


88 


3C 


1 


- 


1 


10 


139 


134 


5 










11 


343 


321 


19 


3 








12 


136 


212 


19 


5 








13 


76 


76 












Tot 10055 


9007 


771 


201 


55 


13 


7 








Percentage dl 


s t r Ibu t lo 


a** 




01 




91*6 


6.S 


1*0 


1.0 






02 




92,6 


5*1 


1.1 


1*1 






03 




84*9 


13*0 


1*8 


0.3 






04 




89*1 


8*4 


2.0 


0*4 






OS 




84*1 


11*1 


3.4 


0*9 


0.2 


0. 3 


06 




92.7 


6*2 


1*0 








07 




91*7 


5.7 


1*6 


0.7 


0.3 


* 


08 




89*5 


7*7 


2*4 


0.4 






09 




91*9 


6*0 


2*0 


* 






10 




96.4 


3* 6 










11 




93*6 


5.5 


0.9 








12 




89.8 


8.1 


2*1 








13 




100*0 












Total 




89*6 


7.7 


2.0 


0*5 


0.1 


0.1 



Note. The patterns of test taking reflected In these data for the In^ 
8 t Itut lonally^tested (INST) samples differ markedly from those reported for 
the basic flrst^tlme International and Special Center (I4SC) samples In^ 
volved In this study. Contributing faators are differences In the time 
periods covered and differences In INST and I&SC patterns of test admlnls^ 
trat Ion. 

* 01 (Taiwan) p 02 (Hong Kong), 03 (Korea), 04 (Thailand), 05 (Japan), 06 
(India) » 07 (iaudia Arabia, Kuwait, Lebanon, Libya, Jordan, Syria, Iraq), 
08 (Iran), 09 (Mexico, Venezuela, Peru, Colomblap Chile), 10 (Germany, 
Netherlands, Norwayp Denmark), 11 (France, Italy, ipaln, Portugal), 12 
(dreece, Turkey), 13 (Ghana, Nigeria). 

Row percentages should equal 100*0 within rounding limits. 
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ly eched^^ isSC test adminiitratiOT, It is assuned that toe len^h of 
the follofeMjp period ms eufficiant to cover titm "tenure" of first-tine 
test ^ers as "menters of the isSC-eKMrfjtee populatian." 

o data^ cn tte oti^r hand, reflect test-taking behavior dteenred during 
only a 12-iaBnth perlrf, ^thin tte f rMaework of ^ttonm of test aAnini- 
stration ttet imdcft^te^y wried across institutiOTS with reject to 
frecp^en^^ puposa, ^gr^ of institutional OOTtrol over aiCTtdree ^rtioi- 
pation^ and so cn. Itoreover^ ir^viduals tested for the first timi in m 
D©T ^ministratiQn in, say, July 1984 ^ had a ni^imM of 12 naittm in 
vW.di to repeat tiie test^ ^d mi imknown nuottoer of ad hoc uppor b iaii ties t^ 
do so; individuals tested for tto first time Airing the latter part of the 
12-«sntii stuc^ period, on the otter h^id, may havm had to qppprtimi^ to 
r^eat. 

Data in I^le 19 point up e^cted differenras totwron the HCT* airf ISSC 
repeater sanples in mean time interval between first (time-1 or t--l) amd last 
(tlme-T or t-r) testis (mean « 4,2 mmitita for ID©T and 11,9 nratiis for I&SC) 
and mean nunber of times tested (mean = 2,4 for imr atri 2.9 for laSC), 

m additicn, raUe 19 Btmm that tte great majori^ (^out 85 peroent) of 
mSl ex^nirees v?ere tested ty institutims located in tte U,S. \Mle c^y 
about 32 percent of tteir i^SC ccunte^arts ymn in the U.S. vAmn tested— for 
8 of tte 10 II®r anal^is groi^^ 90 percent or more of the repeaters were in 
tte U.S. at the time of testing. 



Initial and last Gbserv^ TOraXi Total ^tasrn, and Mem 
Oiange^ for B^eatera ^ Malysis Groi^ 

Table 20 tte initial (tiin^l or t--l) total score mans ar^ the last 

^served (tlue-T or t--T) total score meais for TOST repeaters # tte meai inter- 
val betwem t-1 md t--T^ and meai drnige ([t-Tj - [t-1])^ for the 10 analysis 
groups represOTted ty 10 or more repeaters. Parallel data provided for 
l&BC repeater sai^es in tiie corre^pCTiding anal^is groups (frcm lable 15) 
provide perfective for assesenent of the findings. D®T malysls groi;pB 
were ranked in terms of mean diange (hi^ier tu lower) md ISSC analysis grctps 
were similarly raiAed, Bia two sets of ranks ar# tfw&m in raUe 20. 

Strmg IKST vs l&SC repeater pqp^atim differmces in le^l of p^rfonrH 
ance on TC^FL (consistent vdth differences in tte j^rformaice of all first- 
time test takers rf^wn earlier in Msle 17) are clearly evident. In every 
analysis groijp tte first-time TOEFL perfonnance of TOOT repeaters ymm lower 
than ttet of I^SC r^aters. The t-1 man for all HOT repeaters (424,1) \ms 
scrae 50 points lower tlwi that for tte laSC re^aters (474.6), ftoreowr^ the 
t-T mean for all repeaters (460.2) ymm ssne 14 ^ints Icwer thm tiie 

t-1 mem for i&SC repeaters (474.6) i tdiis was tnae for six of Uie 10 anal^is 
groi^. 

Mean diar^ for DOT r^eaters o^^r time intervals averaging 4,2 montte 
v?as scrae 36 points. Fbr tte respective analysis grot^ps^ ever intends 
averagii^ from 2,4 to 4.7 romtte^ mean ^ange varied frcni atout 22 points (04 
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Table 19 

Means On Selected Personal and Test Itif-Related Variables for INiT Repeaters, 
by Analysis Group, with Conparatlve Data for I&SC Repeaters 



Variables* 



Oroup 




N 




HaaCsts 


Edle^l 


Sex 


deater 


Age 


ZHST 








2m 3 


0*66 


0* 6i 


0*97 


27«0 




01 


3070 


11,6 


2ft 9 


0. 83 


0,41 


0,08 


25,1 


iHST 










0, 45 


0,36 


0ft92 


23,6 


i&sa 


02 


17S1 


13*7 ~ 


3-2 


0.26 


P, 34 


0,06 


19, 3 








^ A 

£ ft «l 


£> 2 


Om 36 


0*41 


0,60 


23.3 


xiiic 


03 


819 


11,7 


2*8 


Oft 76 


0,26 


0. 26 


25ft 3 


INST 


04 




-3 ft 7 




0,72 


0 , 35 


1 ,00 


22,8 


l&SC 


04 


733 


12ft 9 


3ft0 


0, 35 


0, 44 


□ ft 32 


23,8 


INST 


05 


180 


4ft9 


2«4 


0,60 


0,48 


0,69 


23«3 


I&SC 


OS 


1346 


llft3 


3,2 


Oft 50 


Oft 34 


0, 40 


23ftS 


iiiaf 


07 


227 


4,3 


2ftS 


0,58 


0,07 


0,96 


^3,0 


I&SC 


07 


814 


12ft 6 


2ft 7 


0, 34 


0.07 


0, 65 


21,8 


INST 


08 


65 


4.3 


2ft2 


0,63 


0ft37 


1,00 


22.1 


I&BQ 


pa 


1500 


9.9 


2ft 7 


0,31 


Pft21 


0, 79 


21.3 


IHST 


Oi 


112 


4ft5 


ift3 


0.54 


0ft45 


0,99 


23.4 


l&SC 


09 


' 566 


lOftl 


a. 5 


0ft60 


0. 26 


0, 69 


23, 9 


IHST 


11 


21 


3ft8 


2^1 


0ft52 


0.59 


1.00 


22.9 


i4sa 


11 


101 


IP. 6 


2,2 


Oft 78 


0.24 


0. 22 


24,0 


IHST 


12 


23 


3-5 


2.2 


0ft52 


0«13 


0,96 


23-1 




12 


307 


10, 2 


2ft 6 


0, 56 


0.16 


0,34 


21,4 



IHST Total 954 4,2 2,4 0.56 0.36 0,85 23*2 

I^SC Total 11612 11,9 2,9 0,57 0,30 0,32 TTTq 

Note: Groups are 01 CTalwan), 02 (Hong Kong)* 03 CKorea), 04 (Thailand), 
05 Japani 07 (Saudi Arabia, Kuwait, Jordan, Lebanon, Iraq, Syria); 08 
Iran; 09 Chile, Colombia, Haxlco, Peru, VeneEuala; 11 France, Italy, 
Spain, Portugali 12 (Greeoe, Turkey), Mo data are shown for 06 India, 10 
(Germany^ Netherlands, Denmark, Norway, Sweden), and 13 (Ghana, Nigeria), 
represented In the INST sample by fewer than. 10 repeaters with observa^ 
tlons on the study varlablesft Data for I&SC repeaters are from Table 10. 

* Intrvl (Interval) In months between t-1 and t-t| NuAt8Cs (number of 
tines testedi Idlevl (educational levels-graduate * 1, uudergraduate « 0)j 
Bm% (H s 0, F ^ 1) I aaatar (last test administered In U,S, * 1, not In 
UftSft ^ 0)| Age In years at time of Initial testing ( C-1) • 
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Initial and Las£ Observed (e-^T) TOIFL fetal Means far InselCutlonal 

and I&iO Rtpeatfirsi Haan t±mm lateEyal BaCw«en t'^l and t-T, and 

Mean Bm&tm Ohanga, by AnalyalB proup 

Analysis N Maan T^l C-T DlfiareoGe batvaea Baana 

group* Intar^ aeaa maan tlraa^l Co tlaa^t 

val and rank of group 

(■ontha) Dlff laoM* 



INST 


01 


29 


4,7 


439.0 


466.3 


27*3 


( 8) 


14 


n 1 








S08 a 6 


18 a 1 


(10) 


IHiT 


02 


11 


3*6 


468.2 


498.0 


29.8 


( 7) 


I& ic 






t % 7 
JL J ■ / 


Abo ^ 


K 1 Q t\ 

3iw . Q 


29.5 


( 7) 


INST 


03 


129 


2«4 


414.0 


438.6 


24.6 


( 9) 








4x • / 


^ a E c 


^21.1 


25.6 




IHiT 


04 


46 


3.9 


430.1 


451.3 


21.7 


ao) 


11. sa 


04 


733 


12.9 


454.4 


485.1 


30.7 


< 8) 


INST 


OS 


280 


4*9 


430.9 


464.6 


33.7 


( 6) 


x&ic 


05 


1346 


lli3 


470.8 


S03.1 


32. 3 


( 6) 


INST 


07 


227 


4.3 


40i.3 


442.8 


36.5 


< 4) 


is.sc 


07 


814 


12.6 


439.7 


485.4 


45.7 


( 3) 


INST 


08 


iS 


4.3 


416.8 


470.2 


53.4 


( 1) 


I4SC 


08 


ISO 


9.9 


434.4 


488. 6 


54.2 


( 1) 


IHiT 


Of 


112 


4.5 


434. Gf 


485.3 


51,3 


( 3) 


I4SC 


09 


iii 


10.1 


465.8 


513.9 


48.1 


< 3) 


IHiT 


11 


22 


3.8 


482.8 


516.6 


33.8 


( 5) 


IfiriC 


11 


101 


10.6 


S30*7 


566.9 


36.2 


( 5> 


IHiT 


12 


23 


3,5 


406. S 


458.5 


52.0 


< 2) 


I&SC 


12 


307 


10.2 


473.0 


518.0 


43.0 


< 4) 


ALL INST 


9i4 


4.2 


424.1 


460.2 


36.1 




ALL 14 S€ 


11612 


11.9 


474.6 


506.8 


32.2 





* iuaaary mtmtimtlam ars aot reporcad for groupi with H leas ghan lOi Analyala 
groups are 01 (Taiwan). 02 (Hong Kong) . 03 (Itoraa) , 04 (thailand)i 05 ( JApAn) » 
07 (iaudl Arabia^ Ira4i Labanoa, Syria, Kuwuit, Jordan) p 08 (Iran), 09 (Hm%^ 
i€0, Coloabia, Ohila, Paru , VanaEuala) , 12 (Oreaeap Turkay) , 11 (Francap 
Italy j ipain, Fortugal). DaCa not ahovn aaparacaly for 06 India, 10 (GarBanyi 
Netherlands p Horway, Danaark, Swadan) , aad 13 (Ohana-^Niger ia)^^dua to N laaa 
than lO^^are included in Che raapactiva totals* 

* AThese are ranks of groupa in fiaras of diffaranoea between the aeana ahown. 
IHST and I&SC analysia groupa were ranked ae^arataly f roa highest to loweat in 
cerms of these aean difference valuesa ThuSi for axanple, aaong IHiT repeat^ 
era in the 10 analysia groupa, those froa Taiwan ranked eighth in Caras ol 
aean aeore^ohange while aaong I4iC repaacers, those froa Taiwan ranked tenth. 
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Thailmd) to over 50 pQints (08 Irm; 09 rfexico, Qiile^ tolc^ia, Pem. 
Venezuela, 12 Greeee, ^rk^) . 

To the ©Ktrat that, as aiMt^, these institaitiOTal repeaters vmtm 
enrolled in ESL instruetim between the t-1 the t-T teat aArdnist rations, 
these mean dianges be tt^^t of as suggesting wttems of awrage ^ina 
to be yc^cted oror cqri^raM,e perigds of t£ro, for similar nmbero ot these 
analysis gcQiTO roraally elect or are selected into progcM^ o£ B§L 
i nstructiori sudi as those represents in the ^r^le ^ 

It is notaworthy ttot the anal^is grci^© tlmt d^raistrated irore (less) 
mean dianga \arier the wo^b raitrollrt be^reen-^dmlnistration m&r cmditiOTS 
tended to be those that dmofistrated mora (less) mem diange iffrfer tiia \^riad 
conditions pre^ling betwron t-1 md t-^T for ISSC repeaters* Ihe ranking of 
INST malysis gra^p^ corre^OTded quite closely to the rartcing of l&BC analy- 
sis groi^ in term of mean chmge (rho ^ .89), Such ^tematic covariation, 
especially in view of tlte relatiwly ^nall siie of nost of tto imr repeater-- 
saiples and clear popiJ^atiOT differences in TOmL performance, suggests tosia 
differences mmg amlysis grou^ (associated wiUi differences in linguistic- 
cid.tural backgrcOTd) in ttie rate of ac^isitlafi of proficient in Biglish as a 
secOTd language. 



Bpe^ctii^ t-^ Ktal Score for MOT i^eaters 

Being in the U.S* (immersed in an Englidv-speaking en\^irOTMnt) yAmn 
tested (1 vs 0), time intenml in months between first Mrf last tests, msitoer 
of times tested, age, sex (1^1, mO), airi ^catlonal lewl (graduate^ 1, vs 
tffidergradjate ^ 0) wre treated as independent variables, and final (t-T) 
T^fli total score as the depentoit wri^le, in a regressim analysis based an 
dato for Om 954 m&r repeaters, inters between tests and being in the U,S, 
wre tte primary nontest contributors to predlctiCTi of final TOEFL score. 
Standard partial regressim \tfeigfits frOT tim ttmT dialysis and those from a 
parallel l&BC regresslCTi analysis are shorn belowi 

Wri^le Star^rdized wight 







I&SC 


Tiit»-1 total score 


.80 


.77 


Interval in rocmths (Intrvl) 


.13 


.12 


U.S. testing (1,0) 


.10 


.11 


Niaiber of tests taken (Itettsts) 


.03 


.09 


Age 


-.04 


-.09 


Sex (Itel, 1^0) 


-.00 


-.05 


Educational level (Bdlevel) 


.04 


-.01 



Sinple correlation oi t-1 

score mtii t-T score ( *78) ( *72) 

MuLtiple correlaticHi ,80 .76 

Stmdard error of estimate 35.00 35.30 

MjBtiber of cases 954 11,612 
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7n m^uating tiie dimni^ed role of maitoer of times test^ as a prodio- 
tor of final TQWL score (or d^^) in tte imi: aialysis as carparad to the 
laSC analysis, it ^oid.d be kept in mind (a) tdiat imT repeaters had fewer 
^portunitles to repeat arKl (b) Umt the meming of this wri^le nay be 
^^csomAmt different for the H^T sarnies than for l&SC sarnies. It is assured 
ttat in the l§SC cOTt^t, mmtoer of testings is irDre anenable to cOTtrol by 
individual ^caninees ( reflecting , say, notivation, effort, resources), in the 
EOT cont^t, m the otiier ha^, nunter of testings is assinted to be nore 
-mdmr the control of the testing institutions. 



Ihe regreESim analysis provided m ecj^t^ on for predicting t-^ TOEfTj 

total score for msr repeaters, generally, taking into accomt t«l TO^Ij 

score, interval between tests, nuntoer of times tested, and the otiier stud^ 
variables, Ihe total-saii^le regressim equation was as follorai 

t»T TCmi, score m .890 (t-=l score) + 2,357 (Ntatsta) + 2,981 (Inttvl) 
+ 15,604 (U*S,-'tesUj^) ,535 (^) - .452 (Sex) 
- 4,179 (EOlwml) + 66,146, 

Following the same procedures that wre enployed in the analysis of l&BC 
repeaters (see rable 15 are! related dicussion), Uie regression equation based 
on data for IMST repeaters witliout regard to arHJ.ysis groi^ was used to 
ccBpata estiinated t-T TOEFL total means for eadi of tte 10 anal^is groi;ps, 
TBhlB 21 shows the difference between tiie obsei^Bd md tiie ejected ffleams for 
ea^ INST an^^is groi^. Positive discrei^cies ii^cate timt a growp's t-T 
mean -mm higjier than expected ^rfule ne^tiw discrepmcies indiOTte the 
^posite, .toalysis groins are ranked according to the difference between 
i^erved and es^ct^ mems-^lgebraically, high to low. Results of the 
ccnparable discrepant aialysis for l&BC re^aters md a set of rarics for the 
correspcOTding l&SC analysis groups are also mham in Mble 21, 

Kie ra&ing of IKCT raalysis groups in terms of perfornance relatiw to 
eKpect^icy correspcoded closely to tiie rartcing of the laSG ranples f ram the 
same analysis grotps (rho ^ ,888 for the too sets of r^s)* tfiis resist is 
consistent vritii the finding, prwiously reported, of close correspondence 
between rankings of analysis groins in the INCT and tim l&BC Mitples in terms 
of average dwi^ in TOmii total score beteeen initial ani last observed test 
artninistrations. It thus ler^ further mpport to tiie inference tJiat the 
observed amalysis^roup differences reflect basic grot^ differences in 
characteristic rate of acquisitiOT of proficiency in English as a secoraJ 
language , 



Ihe filings r^orted in this secticm are based on data frciti files 
for a sanple of i^ivi&ials In selected national-iin^istic malysis groins 



dialysis Qm^ lAffec^v^s in Gbserved vs ^^ected t-T 
TOE^ TOt^ Means 
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Table 21 



DifferenQe between Observed and Predldted t^T TOEFL Total Means 
for IKST and AnalysiB GroupB 



Obfierved mean minus expecCed mean and 
corresponding rank of group* 



Group** 




laak 



12 

08 
09 
11 
OS 
02 
03 
07 
01 
04 



23 
65 

112 
22 

280 
11 

129 

227 
29 
46 



'♦^14, 2 
+1J. 7 
+ 13.2 
+ 3*0 

- 1.3 

- 1,S 

- 4*0 

- 4*3 

- 8.0 
-14*6 



( 1) 
( 2) 
( 3) 
( 4) 
( 5) 
( 6) 
( 7) 
( 8) 
( 9) 
(10) 



307 
1500 
566 
101 
1346 
1781 
819 
814 
3070 
733 



- 1.9 

= 3.9 

- 7*4 



+ 10*4 
+ 5.6 
+ 11*4 
+ 24,0 
-1.1 
" 1.1 



2.3 



( 3) 
( 4) 
( 2) 
( 1) 
<6.5) 
(6.5) 
( 5) 
( 8) 
( 9) 
(10) 



* Eicpected maans for the INST and Che I4iC analysis groups were based 
on the respaeclva ioial-sample ragresslon equations. 

^* 12 (Greece, Turkey), 08 (Iran), 09 (Hexleo, Chllei Goloabla, Peru, 
Venezuela), 11 (France, Italy, Spalnp Porcugal) ^ 05 Japan, 02 Hong 
Kong, 03 Korea, 07 (Saudi Arabia^ Jordan, Kuwait, Lebanon, Iraq, 
iyria), 01 Taiwan, 04 Thailand. 




\*io took l^fL initially in m instituticml test acininistratim between July 
1984 mid Pik^mt 1985. .toout 10 ^rcent of these individuals accmulated at 
least caie acUitioral test record Airing that periodi of t±^se repeaters^ ^Dout 
80 ^rcent had o^y OTe ^ditiOTal record in the stu^ file for the ^riod 
msdmt ccnsideratlcn, tta mean nimiter of tests per re^ater vms 2,4, and the 
average time intenml between the initial test adndnistration ard ttie last 
^served record was 4.2 inaiths. Sam 85 perrant of the re^aters were tested 
by InstituticHis located in tto iJhited States, 

For 954 ims repeaters wth data an all stu% %^ri^les, the a\rerage 
amount of change in TC^% otal score, oror t-1 to t=-T interrals weraging 
slightly more ttan fair mmths, was 36 sraled-score points, nne interval 
between t-^l md t-T, md being «irolled in a U.S. ratter Uian a nm^.S.insti^ 
tutioi v^mi tested were tim TOitest imriables ecntributing nost to predietim 
of t--T TOmL total score (and ^lange, t-^1 to t-2, inference) for the TOST 
repeaters. 

Mean chaige in ac^ti total v^s greater for scroe analysis groups tiian for 
otiiersi groups registering higher average ^ins in score tended to have Mrfier 
t-T means timi ejected for the average INBT repeater vdth similar tine-1 
scores and nmtest diaracterlstics (that is^ t^l to t-T tine inte^ml^ mmtoer 
of times tested^ u*B* vs otter Institution, a^, sex, and educatiOTal 
level— graduate vs TOdergraduate). 

Wietiier based m mean chmge in TO^ total score or Uie nem difference 
between ^served and ejected t-T scores, toe r^s of natimal-linguistic 
groi^ OTiong w&r repeaters wtre famd to corre^cirf closely to the rar&s for 
tte same national-linguistic grajps among intematicnal itf^ ^cial anter 
(MSC) repeaters^ These ^tematic relatiOTrtu^ dbtained despite the fact 
ttet (a) most of tiie imr analysis groi^ were ^te snail (mtii potential for 
saitpling error), (b) the ciram^tMces associate with test repetition are 
quite different in the too test popd^atioris, md (c) clear population differ^ 
ences in iMtial level of proficiency (as meamired by wm%^. The consistent 
of firrfings thus strei^Uwis tte inference of differences anmg natimal- 
linguistic gra^ in characteristic rate of acquisitim of proficienw in 
English as a secmd language. 

It is assumed ttet toe institutiOTal repeaters were taking ^cial in- 
structioi in Ihglish as a secmd Imgua^ betwe^ test administratiOTs. Tb the 
extent ttet this assimptlOT is ^id, the ^ttem of differaices in average 
gains in wmh performance for ECT analysis groi^ in ttese sanples may be 
thou^t of as indicative of tte pattern of differences ttet migfit be ^{pected 
in similar grouf^ of examinees yAm nay partici^te in "laical" programs of 
special ESL instructiOT, wider "^ical" program cot^tiOTs, over time 
intervals averaging about four mmtos. Of course, toe fact ttet most of the 
analysis groups were relatively snail limits toe accura^ of inferences f rem 
toese data regarding tte ^eclfic "a^rerage ^ins" to be expected for various 
groi:^. 

leseardi is needed to obtain more detailed infocmatim re^rding the 
actual circumstances of ii^viduals #io repeat toe TOmi, in institutiOTal test 
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adbanistratiate^ tiie diaracteristics of instructimal prograte involved, the 
ecT^tions of test adninlstration, and so m, ASMidng the a^ilabill^ of 
su^ infooaation, test-r©tast ^to frcm TOEFL Inatitutimal Pr^rOTi files 
could be to profvicte evidence receding epical ^ttems of TQWL score 

^mge for various mtiOTal-linguistiG groups in institati^al settings 
classified according tr7, say, ^^ical Airatim of E5L progrmi , patterns of 
^wtruction, ^^asii m mgli^ for general puigposes as capoied to Baglish 
for speoifie aca dCTlQ or occu^tCTCT^pur^^s^ arri so on/ 
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